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PREFACE 


This volume is part of a four-volume set that summarizes the research of participants in 
the 1998 AFOSR Summer Research Extension Program (SREP). The current volume. 
Volume 1 of 5, presents the final reports of SREP participants at Armstrong Laboratory. 

Reports presented in this volume are arranged alphabetically by author and are numbered 
consecutively — e.g., 1-1, 1-2, 1-3; 2-1, 2-2, 2-3, with each series of reports preceded by 
a 35 page management summary. Reports in the five-volume set are organized as follows; 
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1998 SUMMER RESEARCH EXTENSION PROGRAM (SREP) MANAGEMENT REPORT 


1.0 BACKGROUND 


Under the provisions of Air Force Office of Scientific Research (AFOSR) contract F4%20-90-C- 
0076, September 1990, Research & Development Laboratories (RDL), an 8(a) contractor in 
Culver City, CA, manages AFOSR’s Summer Research Program, This report is issued in partial 
fulfillment of that contract (CLIN 0003AC). 

The Summer Research Extension Program (SREP) is one of four programs AFOSR manages 
under the Summer Research Program. The Summer Faculty Research Program (SFRP) and the 
Graduate Smdent Research Program (GSRP) place college-level research associates in Air Force 
research laboratories around the United States for 8 to 12 weeks of research with Air Force 
scientists. The High School Apprenticeship Program (HSAP) is the fourth element of the Summer 
Research Program, allowing promising mathematics and science students to spend two months of 
their summer vacations working at Air Force laboratories within commuting distance from their 
homes. 

SFRP associates and exceptional GSRP associates are encouraged, at the end of their summer 
tours, to write proposals to extend their summer research during the following calendar year at 
their home institutions. AFOSR provides funds adequate to pay for SREP subcontracts. In 
addition, AFOSR has traditionally provided further funding, when available, to pay for additional 
SREP proposals, including those submitted by associates from Historically Black Colleges and 
Universities (HBCUs) and Minority Institutions (Mis). Finally, laboratories may transfer internal 
funds to AFOSR to fund additional SREPs. Ultimately the laboratories inform RDL of their 
SREP choices, RDL gets AFOSR approval, and RDL forwards a subcontract to the institution 
where the SREP associate is employed. The subcontract (see Appendix 1 for a sample) cites the 
SREP associate as the principal investigator and requires submission of a report at the end of the 
subcontract period. 

Institutions are encouraged to share costs of the SREP research, and many do so. The most 
common cost-sharing arrangement is reduction in the overhead, fringes, or administrative charges 
institutions would normally add on to the principal investigator’s or research associate’s labor. 
Some institutions also provide other suppon (e.g., computer run time, administrative assistance, 
facilities and equipment or research assistants) at reduced or no cost. 

When RDL receives the signed subcontract, we fund the effort initially by providing 90% of the 
subcontract amount to the institution (normally $18,000 for a $20,000 SREP). When we receive 
the end-of-research report, we evaluate it administratively and send a copy to the laboratory for a 
technical evaluation. When the laboratory notifies us the SREP report is acceptable, we release 
the remaining funds to the institution. 
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2.0 THE 1998 SREP PROGRAM 


SELECTION DATA : A total of 490 faculty members (SERF Associates) and 202 graduate 
students (GSRP associates) applied to participate in the 1998 Summer Research Program. From 
these applicants 188 SFRPs and 98 GSRPs were selected. The education level of those selected 

was as follows: 


1997 SRP Associates, by Degree 

SFRP 

GSRP 

PHD 

MS 

MS 

BS 

184 1 

6 

2 

53 


Of the participants in the 1997 Summer Research Program 90 percent of SFRPs and 13 percent 
of GSRPs submitted proposals for the SREP. One undred and thirty-two proposals from SFRPs 
and seventeen from GSRPs were selected for funding, which equates to a selection rate of 54% of 
the SFRP proposals and of 34% for GSRP proposals. 


1998 SREP: Proposals Submitted vs. Proposals Selected 


Summer 

1997 

Participants 

Submitted 

SREP 

Proposals 

SREPs 

Funded 

SFRP 

188 

132 

20 

GSRP 

98 

17 

4 

TOTAL 

286 

149 

24 


The funding was provided as follows: 

Contractual slots funded by AFOSR 18 

Laboratory funded 22 

Total 40 

Twelve HBCU/MI associates from the 1997 summer program submitted SREP proposals; six 
were selected (none were lab-funded; all were funded by additional AFOSR funds). 
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Armstrong Research Site 

9 

3 

Air Logistic Centers 

31 

5 

Arnold Engineering Development Center 

2 

1 

Phillips Research Site 

30 

10 

Rome Research Site 

29 

12 

Wilford Hall Medical Center 

1 

0 

Wright Research Site 

47 

9 

TOTAL 

149 

40 


Note; Annstrong Research Site funded 1 SREP; Hiillips Research Site funded 6; Rome Research 
Site funded 9; Wright Research Site funded 6. 


The 125 1997 Summer Research Program participants represented 60 institutions. 


Institutions Represented on the 1997 SRP and 1998 SREP 

Number of schools 
represented in the 
Summer 97 Program 

Number of schools 
represented in 
submitted prc^)osals 

Number of schools 
represented in 
Funded Prc^)osals 

125 

110 

55 


Thirty schools had more than one participant submitting proposals. 


Proposals Submitted Per School 



G Submitted 
O Selected 


Number of Proposals 
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The selection rate for the 65 schools submitting 1 proposal (68%) was better than those 
submitting 2 proposals (61%), 3 proposals (50%), 4 proposals (0%) or 5+ propo^s (25%). 
The 4 schools that submitted 5+ proposals accounted for 30 (15%) of the 149 prqxKals 

submitted. 

Of the 149 proposals submitted, 130 offered institution cost sharing. Of the funded proposals 
which offered cost sharing, the minimum cost share was $3046.00, the maximum was 
$39,261.00 v^Tth an average cost share of $11,069.21. 


and Insritution Cost Sharing 


Proposals 

Submitted 

Proposals 

Funded 

With cost sharing 

117 

32 

Without cost sharing 

32 

8 

Total 

149 

40 


The SREP participants were residents of 31 different states. Number of states represented at 
each laboratory were: 


States Renresented, by Proposals Submitted/Selected per Laboratory 


Proposals 

Submitted 

Proposals 

Funded 

Armstrong Laboratory 

31 

5 

Air Logistic Centers 

9 

3 

Arnold Engineering Development Center 

2 

1 

Phillips Laboratory 

30 

10 

Rome Laboratory 

29 

12 

Wilford Hall Medical Center 

1 

0 

Wright Laboratory 

47 

9 


Nine of the 1997 SREP Principal Investigators also participated in the 1998 SREP. 


ADMINISTRATIVE FVAT.DATION : The administrative quality of the SREP associates’ final 
reports was satisfactory. Most complied with the formatting and other instructions provided to 
them by RDL. Thirty-seveb final reports have been received and are included in this report. 
The subcontracts were funded by $992,855.00 of Air Force money. Institution cost sharing 
totaled $354,215.00. 
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TECHNICAL EVALUATION : The form used for the technical evaluation is provided as 
Appendix 2. Thirty-five evaluation reports were received. Participants by laboratory versus 
evaluations submitted is shown below: 



Participants 

Evaluations 

Percent 

Armstrong Laboratory 

5 

4 

95.2 

Air Logistic Onters 

3 

3 

100 

Arnold Engineering Development Center 

1 

1 

100 

Phillips Laboratory 

10 

10 

100 

Rome Laboratory 

12 

12 

100 

Wright Laboratory 

9 

5 

91.9 

Total 

40 

35 

95.0 


Notes: 

1: Research on four of the final reports was incomplete as of press time so there aren’t any technical 
evaluations on them to process, yet. Percem complete is based upon 20/21 =95.2% 

2; One technical evaluation was not completed because one of the final reports was incomplete as of 
press time. Perceiu complete is based upon 18/18=100% 

The number of evaluations submitted for the 1998 SREP (95.0%) shows a marked 
improvement over the 1997 SREP submittals (65%). 

PRCXjRAM EVALUATION: Eadi laboratory focal point evaluated ten areas (see Appendix 
2) with a rating from one (lowest) to five (highest). The distribution of ratings was as follows: 



Rating 

Not Rated 

1 

2 

3 

4 

5 

ft Responses 

7 

1 

7 

62 (6%) 

226(25%) 

617 (67%) i 


The 8 low ratings (one 1 and seven 2’s ) were for question 5 (one 2) “The USAF should 
continue to pursue the research in this SREP report” and question 10 (one 1 and six 2’s) “The 
one-year period for complete SREP research is about right”, in addition over 30% of the 
threes (20 of 62) were for question ten. The average rating by question was: 
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1 

2 

3 

4 

5 


7 

OO 

9 



4.6 

4.6 


4.7 

4.6 

MM 

4.8 

4.5 

4.6 



The distribution of the averages was: 


AREA AVERAGES 



Area 10 “the one-year period for complete SREP research is about right” had the lowest 
average rating (4.1). The overall average across all factors was 4.6 with a small sample 
standard deviation of 0.2. The average rating for area 10 (4.1) is approximately three sigma 
lower than the overall average (4.6) indicating that a significant number of the evaluators feel 
that a period of other than one year should be available for complete SREP research. 
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The average ratings ranged from 3.4 to 5.0. The o\’erall average for those reports that were 
evaluated was 4.6. Since the distribution of the ratings is not a normal distributicm the average 
of 4.6 is misleading. In fact over half of the reports received an average rating of 4.8 or 
higher. The distribution of the average report ratings is as shown: 


AVERAGE RATINGS 

18 
16 
14 
12 
10 
8 
6 
4 
2 
0 

3.0 3.2 3.4 3.6 3.8 4.0 AJ. 4.4 4.6 4.8 5.0 



It is clear from the high ratings that the laboratories place a high value on AFOSR’s Summer 
Research Extension Programs. 
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3.0 St^CONTRACTS SUMMARY 


Table 1 provides a summary of the SREP subcontracts. The individual reports are published 
volumes as shown: 

Laboratory Volume 

Armstrong Research Site 1 

Arnold Engineering Development Center 5 

Air Logistic Centers 5 

Phillips Research Site 2 

Rome Research Site 3 

Wright Research Site 4 
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SREP SUB-CONTRACT DATA 


Report Author Author's 

Author's Ceirersity Degree 


Chubb, Gerald 
Industrial Engineering 98-0829 

Ohio State University, Coluabus. OH 


Sponsoring _ . _ _ Contract Univ. 

Lab Performance Penod Amount Sh 


AL/HR 01/01/98 12/31/98 $25000.00 $ 

Scoring Pilot Performance of Basic "light 
Manuevers 


Univ. Cost 
Share 


Foy, Brent PhD 

Medical Physics 98-0828 

Wrigbt Sute University, Dayton, OH 

Lance, Charles PhD 

Psychology 98-0842 

Univ of Georgia Res Foundation, Athens, GA 

Woehr, Dasid PhD 

Department of Psychology 98-0802 

Texas A A M Univ-Collcge Station, College 

Collins, Frank PhD 

Mechanical Engineering 98-0807 

Tennessee Univ Space Institute, Tullahoma, TN 

Whaley, Paul PhD 

Mechanical Engineering 98-0820 

Oklahoma Christian Untr of Science & Art, 

Balas, Mark PhD 

Applied Math 98-0816 

Univ of Colorado at BouMer, Boulder, CO 

Dune,Neb PhD 

Astrophysics 98-0808 

University of New Mexka. Albuquerque, NM 

Hanson, George PhD 

Electrical Engineering 98-0811 

Univ of Wisconsin • Milwaukee, Milwaukee, W1 

Jeffs, Brian PhD 

Electrical Engineering 98-0813 

Brigham Young University, Provo, UT 

Kar, Arasinda PhD 

Engineering 98-0812 

University of Central Florida, Orlando, FL 

Leo, Donald PhD 

Meebankal & Aerospace 98-0810 

University of Toledo, Toledo, OH 

Liu, Hanli PhD 

Physia 98-0814 

Univ of Texas at Arlington, Arlington, TX 

Bienfang, Joshua BS 

Physics 98-0815 

University of New Mexks. Albuquerque, NM 

Paulson , Eric BS 

Enginecring/Physics 98-0837 

Univ of Colorado at Botider, Boulder, CO 


AL/OE 01/01/98 12/31/98 $25000.00 511278.00 

Development & Validation of a 
rhysiologically-Based Kinetic Model c£ Perfused 

AL/HR 01/01/98 12/31/98 $24989.00 $0.00 

Extension of Job Performance Measurement Tech to 
the Development of a Prototype 

.4L/HR 01/01/98 12/31/98 $25000.00 $11508.00 

Validation of The Multidimensional work ethic 
profile (MKiF) as a screening tco 

AEDC/E 01/01/98 12/31/98 $25000.00 516104.00 

Mcnte Carlo Computation of SpeciesSepaaration by 
a Conical Skiamer in Hypersc.nic 

ALC/OC 01/01A8 12/31/98 $23351.00 $3046.00 

Probabilistic Analysis of Residual Strength in 
Corroded and Dncorroded Aging Air 

PL/SX 01/01i98 12/31/98 $25000.00 $0.00 

Non-Linear Adaptive Control for a Precision 
Deployable Structure with White ligh 

PL/u Qvoim i2/3im $ 25000.00 $ 5777.00 

Image Recovery Using Phase Diversity 

PL/WS 01/0U98 12/31/98 $25000.00 $23250.00 

Perturbation Analysis of the Natural Frequencies 
Targets in Inhomogeneous Media 

PL/U 01/0198 12/31/98 $25000.00 $19177.00 

Bayesian Restoration of Space object Images From 
Adaptive Optics Data with unkno 

PL/LI 01/0198 12/31/98 $25000.00 $5414.00 
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1. RArKGROUNDi 


Research & Development Laboratories (RDL) is uttder contract 


,T 49620 - 93 -C- 0063 ) to the United Steles Air Force to administer the Summer 
Research Program (SRP), sponsored by the Air Force Office of Scientific Research 
uAFOSR), Bolling Air Force Base, D C. Under the SRP, a selected number of college 


faculty members and raduate smdents spend part of the ^er conduming research 
^ Air Force laboratories. After completion of the summer tour participants may 
submit, through their home institutions proposals for follota-on research. The follow- 


on research is known as the Summer Research Extension Program (SREP). 
.Approximately 61 SREP proposals annually will be selected by the Air Force for 
funding of up to $25,000; shared funding by the academic institution is encouraged. 
SREP efforts selected for funding are administered by RDL through subcontracts with 
the institutions. This subcontraa represents an agreement between RDL and the 
institution herein designated in Section 5 below. 


2 Pr>T PAYMENTS: RDL wiU provide the foUowing payments to SREP institutions. 

. 80 percent of the negotiated SREP doUar amount at the start of the SREP 

research period. 

. The remainder of the funds within 30 days after receipt at RDL of the 
acceptable wrinen final report for the SREP research. 


3 rxiCTT-nmON-c: PFSPONSIBILITIES; As a subcontractor to RDL, the institution 
designated on the title page will. 






a. Assure that the research performed and the resources utiUzed adhere to those 
defined in the SREP proposal. 

b. Provide the level and amounts of institutional support specified in the SREP 
proposal.. 

c. Notifi- RDL as soon as possible, but not later than 30 days, of any changes in 
3a or 3b above, or any change to the assignment or amount of participation of 
the Principal Investigator designated on the title page. 

d. Assure that the research is completed and the final report is delivered to RDL 
not later than twelve months fi-om the effective date of this subcontract, but no 
later than December 31, 1998. The effective date of the subcontract is one 
week after the date that the institution’s contracting representative signs this 
subcontract, but no later than January 15,1998. 

e. Assure that the final report is submitted in accordance with Attachment 3. 

f Agree that any release of information relating to this subcontract (news 
releases, articles, manuscripts, brochures, advertisements, still and motion 
pictures, speeches, trade associations meetings, symposia, etc.) will include a 
statement that the project or effort depicted was or is sponsored by: Air Force 
Office of Scientific Research, Bolling .■^FB, D.C. 

g. Notify RDL of inventions or patents claimed as the result of this research as 
specified in Attachment 1. 

h. RDL is required by the prime contract to flow down patent rights and technical 
data requirements to this subcontract. Attachment 2 to this subcontract 
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contains a list of contract clauses incorporated by reference m the pnme 
contract. 


4 --Ml notices to RX)L shall be addressed to, 

RJDL AFOSR Program Office 
5800 Uplander Way 
Culver City, CA »D230-6609 

5 By their signatures belo^v. the parties agree to provisions of this subconract. 


Abe Scpher 

RDL Contracts Manager 


Sisnature of Institution Contracting Official 


Typed/Printed Name 


Date 


Title 


Institution 


Date Phone 




ATTACHMENT 2 
CONTRACT CLAUSES 


This contract incorporates by reference the following clauses of the Federal A^uismon 
Regulations (FAR), vsith the same force and effea as if th,^ were given m&Ute«. Upon 
request, the Contracting OfBcer or RDL will make then full text available (FAR 52,252-2). 
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52.203- 5 

52.203- 6 
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recovery of funds FOR ILLEGAL OR IMPROPER 
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SECURITY REQUIREMENTS 

PROTECTING THE GO\ERN’MENT’S 
INTEREST WHEN SL^CONTRACTING WITH 
CONTRACTORS DEBARRED, SUSPENDED, OR 
PROPOSED FOR DEB.ARMENT 

DEFENSE PRIORITY ANT) ALLOCATION 
REQUIREMENTS 

AUDIT AND RECORDS - NEGOTIATION 

PRICE REDUCTION FOR DEFECTIVE COST 
OR PRICING DATA 
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52.215-12 


52.215- 14 

52.215- 8 
52.215.18 

52.222- 3 

52.222- 26 

52.222- 35 


52.222- 36 

52.222- 37 

52.223- 2 
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52.224- 1 

52.224- 2 

52.225- 13 


52.227-1 
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SUBCONTRACTOR COST OR PRICING 
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CONMCT LABOR 
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52.230-5 
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nrscRiPTlON 

252.203-7001 
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252.233-7004 
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SUMMER RESEARCH EXTENSION PROGRAM 
TECHNICAL EVALUATION 


£REP No; 98-0810 

rrincipal Investigator; DR Donald Leo 

University of Toledo 

Circle the rating level nirr±>er, 1 (low) through 5 (high) , 
you feel best evaluate each statement and return the 
completed form to RDL by fax of mail to: 

ROL 

Attn: SREP Tech Bvals 
S800 Uplander Way 
Culver City, CA 90230-6608 
(310)216-5940 or (800)677-1363 


1. This SREP report has a high level of technical merit. 

Z. The SREP program is important to accomplishing the laib's mission. 

3. This SREP report accomplished what the associate's proposal promised. 

4. This SREP report addresses areals) irt^jortant to the OSAF. 

5. The OSAF should continue to pursue the research in this SREP report. 

6. The OSAF should maintain research relationships with this SREP associate. 

7. The money spent <» this SREP effort was well worth it. 

8. This SREP report is well organised and well written. 

9. I'll be eager to be a focal point for summer and SREP associates is the future. 

10. The one-year period for cooplete SREP research is about right. 


1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 
1 2 3 4 5 


11. If you cculd change any one thing about the SREP program, what would you change: 


12 . What do you definitely NOT change about the SREP program? 


PLEASE USE THE HACK FOR ANY OTHER COMMENTS 

Laboratoryfhillips Laboratory 
Lab Focal PointCapt Jeanne Sullivan 
Office SymbolAFRL/VSDV 


Phone: 


(505)846-2069 
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A STATISTICAL MODEL PREDICTING THE IMPACT OF A SCENE 
PROJECTOR’S NONUNIFORMITY ON A TEST ARTICLE’S IMAGE 


Brian P. Beecken 
Professor 

Department of Physics 
Bethel College 

ABSTRACT 

A statistical model has been developed and used to calculate the contribution of a scene 
projector array to the nonuniformity of a test article’s output image. The output image non¬ 
uniformity is different from the detector array s nonuniformitv because of the nonunifonnity of 
the projector array and the relative positions of emitter images on the detector array as deter¬ 
mined b\ the sampling ratio. These two factors are combined with the nonuniformitv of the 
detector array using standard error propagation techniques. In order to calculate the predicted 
output nonuniformity, a weighting function was developed to determine the contribution of the 
different emitters upon individual detector elements. It is through this weighting function that 
parameters such as the sampling ratio, the fill factor of the detector array, the optical blur of the 
emitters, and the alignment of the emitters with respect to the detectors affect the nonuniform¬ 
ity. A computer program has been written to numerically approximate the weighting function 
for a user defined set of parameters. The model presented in this paper represents a significant 
improvement over one reported earlier.[1] The new version includes noninteger sampling ratios 
and contains a number of provisions for possible experimental concerns. Such issues include 
detector offset nonuniformity, diffraction, uncertainty in projector nonuniformity, and spatial 
droop. 




A STATISTICAL MODEL PREDICTING THE IMPACT OF A SCENE 
PROJECTOR’S NONUNIFORMITY ON A TEST ARTICLE’S IMAGE 


Brian P. Beecken 


Introduction 

The KHILS facility at Eglin AFB tests infrared imaging sensors in the lab by projecting simulated 
IR scenes that subtend the sensor’s field of ^-iew.[2] The IR projector, an integral component 
of the simulation testing, is produced under the WISP (Wideband Infrared Scene Projector) 
program. 

The WTSP projector consists of an array of emitters at least as large in number as the array 
of detectors on the sensor’s focal plane. [3] Unfortunately, the fact that the projector consists of 
an array of emitter pixels will cause some nonuniformity in the test article’s output image. This 
circumstance occurs for tvi'o reasons. First, when the emitter array is set to produce a uniform 
IR scene, there is significant nonuniformity in the array’s output.[4] Although this nonuniformity 
can be corrected to a large extent, it can never be completely eliminated. Second, unless there 
is precisely the same number of emitters for every detector element, some detectors will “see” 
more emitters (or at least larger portions of them) than other detectors. Thus, the pixelized 
nature of the projector will exist as an artifam in the test article’s output image. This artifact is 
solely the result of the simulation, and represents a degradation of the actual scene that would 
be encountered during the sensor’s mission. 

The goal of this report is to predict analytically how the nonuniformiU' of the projector 
impacts the nonuniformity of the test sirticle's output image. A statistical model for this pre¬ 
diction has been developed previously. [1] The present work represents a significant extension of 
that model so that it includes non-integer sampling ratios. Also considered are detector offset 
non-imiformity, the difference between a Gaussian approximation and Fraunhofer diffraction, 
the impact of inaccurate knowledge of the projector nonuniformity, and the effect of spatial 
droop on the model. Such information will be valuable for determining both the realism of the 
simulation and the nonuniformity correction requirements. These topics also coincide wth those 
discussed in the proposal for the grant which funded this work. [5; In order to facilitate the use of 
this report, each of these topics has its own subsection. The exception is non-integer sampling 
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ratios, which are such a central part of the statistical model that they can not be separated out. 

In order to make om" model as general as possible, a number of parameters will be required. 
These parameters are the projector’s nommiformity, the detector array’s nonuniformit}-. the 
linear ratio of the emitters to detectors (i.e., the sampling ratio), the relative size of the optical 
blur on the detector array, the alignment of the emitters relative to the detectors, and the fill 
factor of the detector array. These are behev'ed to be the test parameters of primary importance 
and will be incorporated into the foUowng analysis. 

1 Theory 

1.1 The Problem of Positional Nonuniformity with No Blur 

The sampling ratio n:m is defined as the ratio of emitters n to detectors m when counted in one 
dimension. Non-integer sampling ratios cause a type of nonuniformity that we will call positional 
nonuniformity. This type of nonunifonnity will increase the nonuniformity of the test article’s 
output image and can occur any time there is a non-integer sampling ratio. It has nothing to 
do with the emitter array's nonunifonnity. It is due solely to the fact that if the sampling ratio 
is non-integer, then in the vast majority of cases, there will not be an equal number of emitter 
images over each detector element. 

In order to illustrate this issue, consider a non-integer sampling ratio of 4:3. Because we are 
considering the case of no bl\ir, it is appropriate to represent the image of the point emitters 
as small, solid dots as shown in Fig. 1. The detector elements are represented as open squares. 



Figme 1: Illustrations of a samphng ratio of 4:3 with essentially “no blur.” 


Clearly, in Fig. la one detector element has four times as much incident flux as some of the other 
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detectors. This arrangement leads to an inherent nonuniformity even if each emitter is producing 
the same amount of radiation. A translation of half an emitter spacing (diagonally) produces a 
different pattern as shown in Fig. lb, an alignment that despite having less nonuniformiw still 
falls far short of being uniform. 

Careful consideration of these and many other sampling ratios and alignments leads :o the 
edmost obvious conclusion that the patterns will repeat in each direction after m detectors. 
Thus, the positional nonuniformity pattern will always have a minimum pattern size that is 
m X m. We will refer to this square as a chunk, one two-dimensional spatiad period of the 
nommiformity pattern across the detector array. A calculation of the positional nonunifermity 
for this chunk gives the positional nonuniformity for the entire FPA. An algorithm was de-^eloped 
and a computer routine written to rapidly calculate the positional nonuniformity for l large 
number of non-integer samphng ratios. This calculation was performed for the case wh-^re the 
emitters are spaced as far from the edge of each detector element as possible, such as in Fig. la. 
Many interesting features are apparent in the plot of this calculation shown in Fig. 2. Obviously, 
the nonuniformity goes to zero when the sampling ratio is an integer. In such cases them is no 
positional nonuniformity because each detector has the same number of emitters positioned 
above it. Second, the nonurdformity reaches local maxima between each integer sampling ratio. 
Third, the maximum positional nonuniformity occurs nearly halfway between the nearest integer 
ratios.^ Finally, the effects of positional nonuniformity decrease as the samphng ratio increases. 

It turns out that in this kind of analysis alignment plays a very imponant role. If an emitter 
is aligned with each corner of the chunk, as for example in Fig. lb, then rather than a smooth 
trend, sHght changes in samphng ratio cause large and seemingly unpredictable discrete jumps 
in the nonuniformity (see Fig. 3). In general, those sampling ratios which are the simplest 
fractions have the most significant drop in nonuniformity with a change in alignment. Tae two 
plots shown here represent those alignment cases that have the most positional nonunirormity 
and the least. 

1.2 The General Case 

We wiU now calculate how the test article’s nonuniformiU' depends on detector array aonuni- 
formity. emitter array nonuniformity, and positional nonuniformity. The calculation wili Lead to 

^As the sampling ratio is increased, each successive maximum approaches more closely to the halfmy point 
between the nearest integer sampling ratios. 
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Figure 2: Plot of maximum positional nonuniformity with essentially “no blur.” 


a general equation that is valid regardless of blur size, alignment, or other such parameters. 

The output signal u of a detector is often approximated by 

V = gr]^ + b ! 1) 

where g is the detector's conversion gain, g is the detector’s quantum efficiency, $ is the average 
number of incident photons per integration time, and b is the signal offset that exists even for 
zero incident photon flux. The output signal v can only be represented by a linear function 
over a limited region. It is this region, however, for which two point calibration (offset and gain 
correction) is performed. In this paper, we will only be considering those situations in which 
nonuniformity noise dominates over the other noise sources. 

Each detector on the array produces a signal that differs from the other detectors if either the 
conversion gain or the quantum efficiency of the detector varies, or if the incident photon flux is 
different. We are not interested in whether a variation in signal is due to a detector’s conversion 
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Figure 3: Plot of minimum positional nonuniformity with essentially “no blur." 


gain or quantum efficiency. Therefore, these terms will be combined together to represent the 
detector’s sensitivity gr]. The nonuniformity of the detector array can then be conveniently 
represented as the standard deviation of all the detector sensitivities di\'ided by the average 
sensitivity, Similarly, the nonuniformity of the emitter array can be represented by 

where 0 is the average emittance across the emitter array and cr^ is the standard deviation. 

Equation (1) is a product of two terms (detector sensitivity and incident flux) that each have 
an uncertainty. The standard error propagation equation in this situation is 



So far everything has been straightforward: however, the difficulty at this point is calculating 
the ratio which represents the nonuniformity of the radiation incident on the detector array. 
Obviously this will depend to some extent on the emitter array nonuniformity but it will 
also depend on the positional nonuniformity discussed above in Subsection 1.1. First, let us 
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carefully define some terms: 


= number of photons incident on detector i,j during the integration time. 
4)~yS = number of photons emitted by emitter 7 ,6 during the integration time. 

ujys = weighting function denoting the fraction of the total fiux from 

a particular emitter 7,5 which lands on a particular detector i,j. 


With these definitions, the total flux incident on a particular detector is given by 

^75^7(5- (4) 

7,5 

Using standard error propagation technique, we have 



which reduces to 

4., = E <.] • (6) 

Each indi\'idual emitter’s flux is approximated by the average flux ^ of all the emitters. 
Similarly, all the uncertainties in the emitted flux are the same, because they are all 
represented by the same standard deviation of the emittances across the emitter array. Thus, 
= CF^. Furthermore, because the weighting function will be somewhat different for every 
detector i,j in the chunk, the sum of the weighting functions squared will be slightly different. 
Therefore, we must take the average of the summation that appeairs in the second term. Then 
the calculation will give the appropriate cr|, for all detectors in the chunk: 

4 = + ( 7 ) 

7,5 7,5 

It is well-known that the sum of the variances is equal to the variance of the sum. Thus, 

E (8) 

7,5 

In this equation, the first term represents the positional nonuniformity and the second is the 
emitter nonuniformity. Realizing that the value for the average flux incident on a detector should 
be 

^ = <AE‘*^')'«’ (9) 

7,5 
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we can substitute Eqs. (8) and (9) into Eq. (2), 
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(10) 


This result is very satisfying because the three contributions to the output image nonuniformity— 
detector array nonuniformity, emitter array nonuniformity, and the nonuniformity arising from 
the relative positions of the emitters to the detectors—are all easily identifiable. With the ex¬ 
ception of the weighting factor, the noise terms simply add. The weighting function is critical 
to using this equation. The determination of it will be discussed in Section 2. 


1.3 Detector Offset Nonuniformity 

One of the concerns that has been raised about our approach is the use of Eq. (1). It has been 
correctly pointed out that each detector element will not only have a different conversion gain 
g and quantum efficiency rj, but will also have a different signal ofeet b. Thus, at first glance it 
would seem that we are only accounting for two of the three sources of performance differences 
between detector elements. 

Such a concern is not relevant if all detector elements have undergone a two-point calibration 
zind are being operated within the reinge of the cedibration. Two-point cafibration sets all 
detectors to give the same output signal at two points, one at a low incident flux and the other 
at a high incident flux. The first point is the offset. The amplifier gain is then set so that all 
detectors have the same signal response at the second point. Often this cafibration is referred 
to as “offset and gain correction.” Thus, after cafibration each detector should have the same 
^•alue of b. Why then is there nonuniformity in gg? Each detector will respond to increased 
incident flux in different ways. Only when the increased flux is the same as the second point 
of the cafibration do all the detectors have the same response. If all the detectors had a linear 
response, then the cafibration would eliminate all detector nonuniformity. Since the detectors’ 
behavior is nonlinear, the detector array only exhibits uniformity at the two points at which the 
cafibration was performed. This situation is iUxistrated in Fig. 4. 

The use of to represent the uncertainty of a particular detector’s response is an appropri¬ 
ate representation of the various responses possible when the incident flux levels are between the 
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Incident Flux <I> 


Figure 4; Illustration of two point calibration for four different detectors. Only one of the detec¬ 
tors is linear. The different detector responses between the two calibration points is quantified 
by cjgr,. 

caUbration points. Outside of this region, neither this paper’s calculation of expected nonuni¬ 
formity nor the calibration itself is ^ulid. If, for instance, someone were to measure the array’s 
nonuniformity below the offset, significant and unpredictable nonuniformity would result. 

2 Procedure for Determining the Weighting Function 

In the previous section, we derived Eq. (10), which predicts the nonuniformity of the detector’s 
output image. In order to use this equation, however, it is necessary- to know the weighting 
function ujjs as defined in Definition (3). Unfortunately, the weighting function cannot be solved 
analytically. In this section, we -R-ill describe om: numerical approach to finding the weighting 
function. We chose to approximate the distribution of the flux incident on the FPA from a single 
emitter with a two-dimensional Gaussian function. The validity of this approximation will be 
discussed in the second part of this section. 
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2.1 Numerical Calculation of the Weighting Function 


The portion of the emitter’s flux that lands on a detector is determined by integrating the two- 
dimensional Gaussian over the detector’s area. The fraction of each individual emitter’s total 
flux that is incident on a particular detector i,j is what we have defined as the weighting 
function.^ The average number of emitters per detector is simply the square of the sampling 
ratio, (^)^- Therefore, on average, as the fill factor of the detector array goes to 100%, the sum 
of the weighting function must approach this value; 



( 11 ) 


In this equation, the sum is over all emitters near enough to a particular detector to contribute 
incident flux. The average is taken over all detectors in the chunk. 

Although the idea behind the calculation of the weighting function is relatively simple, it 
becomes difficult in practice for three reasons. First, since we cannot analytically solve the 
integral of the two dimensional Gaussian over square areas, we approximated it numerically by 
doing Riemann sums. Second, the equation for becomes rather complex once parameters 
such as fill factor and alignment are taken into account. Third, the weighting function will 
be different for e\-ery detector i,j within the chunk because the positioning of the emitters 
with respect to the detector is different for each detector. Our approximation for the weighting 
function is; 


Ft.Fv 


c c - 


2-(.2616A'C)2 5iS' 



2 


.2616/C 


.2616/C 


( 12 ) 


The integration of the two-dimensional Gaussian is approximated by using a number of 
“detection points’" distributed across the active detector area. The total number of detection 
points per detector is C^. The value of the Gaussiam at each detection point is multiplied by 
the fraction of the detector’s area being represented by the detection point {Fx and Fy 
axe discussed below). The sum of these \-alues \ields a good approximation of the integral, 
assuming that a large enough value of C has been chosen. The computer program we developed 
to calculate will automatically choose an appropriate value for C when given \-alues for the 

^This definition of the weighting function represents a significant departure from the definition employed in 
our previous work, Ref. 1. 
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sampling ratio ^ and the relati\'e optical blur size K (discussed below). Cleeirly, a larger C will 
provide a better approximation, but computer processing time increases rapidly with C. 

The relative optical blur size K is the size of the optical blur relative to the detector elements 
on the FPA. We have defined K to be the number of detector pitches in the diameter of a circular 
b.ur containing 83.9% of the radiation. It has previously been shown[ll that K can be found 
experimentally using 

^ = ( 13 ) 

where / is the focal length, A is the wavelength of the incident radiation, w is the detector 
pitch, and D is the diameter of the system’s aperture. It has also been shot^m in Ref. 1 that 
the (7 of the Gaussian can be written as 0.2616Rr. This form appears in the weighting function 
Eq. (12) in place of the usual a in the two-dimensional Gaussian, both in the exponent and in 
the normalization factor. 

In Eq. (12), Tx and Ty accoimt for the alignment of the emitter array relative to the FPA. 
Each variable represents a translation in the x or y direction measured as a fraction of the 
emitter spacing. Zero translation is defined as one emitter being located precisely above each 
corner of the chunk of detectors.^ If the values for translation range firom 0 to 0.5, all possible 
patterns will be covered."* The detector’s fill factor is accounted for by Fx and Fy, which are 
the linear fill factors in the x and y directions. Each must be in the range of 0 to 1, where 1 is 
a 100% linear fill factor. 

2.2 Gaussian Distribution versus Fraunhofer Diffraction 

\\'hen light passes through the aperture of an optical system, it undergoes Fraunhofer diffraction. 
Clearly, a two-dimensional Gaussian distribution is only a rough approximation to the central 
maximum and the surrounding fringes. The existence of the central bright spot, or Airy disk, is 
acknowledged widely enough that it seemed to be the most appropriate measurement for blur 
rize. Consequently, in the preceding subsection we defined K in terms of the diameter of a blur 
containing 83.9% of the incident radiation, the same percentage that exists in the Airy disk.[6] 
Cnfortunately. there is no such thing as an Airy disk for the Gaussian that we have used, and 
50 this criterion is really rather arbitrary. 

^This definition of zero translation represents a significant departure from the definition employed in our 
previous work, Ref. 1. 

‘'in many cases the pattern will repeat with a translation smaller than 0.5. 
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In response to some concerns expressed at AeroSense 1998, and in an effort to obtain the 
most accrirate calculation of nonuniformity possible, a significant amount of efifort was devoted 
to stud\ing Fraunhofer diffraction in the present context. Fraunhofer diffraction is normally 
expressed mathematically in terms of Bessel functions. The Gaussian form of the weighting 
function was replaced with an appropriate expression using Bessel functions. The weighting 
function could then be easily used to determine what fraction of the radiation from a single 
emitter falls on all the nearby detectors. This calculation is equivalent to turning on only one 
emitter and looking at its blur with a detector array. 

Interestingly enough, the predicted numbers for the nearby detector elements were such that 
it was very difficult, but not impossible, to discern a diffraction pattern. As would be expected, 
the pattern became somewhat more obvious as the resolution was improved by enlarging the blur 
so that many detector elements were involved. The circular nature of the diffraction pattern was 
at least partially obscured by the use of square detectors. In addition, it is important to keep 
in mind that because 83.9% of the radiation is in the central spot, relatively little radiation is 
left to define the diffraction rings. The implication of these calculations is that it would be very 
difficult to experimentally observe a diffraction pattern unless the number of detector elements 
per emitter were unrealistically large. 

The form of the weighting function using Bessel fimctions was incorporated into the computer 
model. For large blur sizes {K > 4) there were no discernable differences from the usual model. 
The results for small blur sizes, however, were disturbingly inconsistent. The problem increased 
as the blur size decreased. Eventually, the difficulty was traced to the fact that Bessel functions 
are only a valid representation of Fraunhofer diffraction in the small angle approximation. A 
smaller blur size is the equivalent of putting the detector array closer to the aperture. In that 
configuration, the angles to the detection points become too large to meet the criteria of a small 
angle approximation. Thus, except for large blur, the Bessel functions are not a valid way of 
predicting the distribution of the radiation on the detector array. Because this is a problem for 
blur sizes of practical interest (0.5 <K < 2), it was decided that the Gaussian approach to the 
weighting function was the better approximation. 

One more point about Fravmhofer diffraction m\ist be made. Even if the detector array 
were very large so that there were plenty of detector elements available to resolve the diffraction 
pattern, diffraction would still be extremely difficult to observe. The reason is that the diffraction 
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pattern is most easily seen when the radiation source is monochromatic. Since the emitters are 
multispectral, not monochromatic, the diffraction pattern must be calculated owr the entire 
wavelength bandpass. Such a calculation would be very difficult, if not impossible, because 
the relative intensities of the different wavelengths would be governed by a Planck blackbody 
distribution, which in turn would depend on the temperature of the emitter. Nevertheless, an 
important generalization can be made. If the bandpass is large enough to include a doubling of 
the wavelength, then there will be no discemable diffraction pattern. Doubling the wavelength 
of the radiation will slightly more than double the diameters of the diffraction rings. Thus, the 
bright rings of one wavelength will overlap with the dark rings of another wavelength. 

The multispectral nature of the radiation is the primary reason that diffiraction will not be 
observed, and it explains the recent experimental observation that the shape of the blur “is not 
Gaussian or Airy-diffraction like, but looks more like a narrow Gaussian added to a broader 
Gaussian.” [7] This description fits the expected appearance of a multispectral diffraction pattern. 
More work needs to be done in this area. Clearly, the weighting function and the present model 
for calculating nonuniformity would be improved with a function that better describes the true 
distribution of the radiation incident on the FPA. 

3 Results 

The goal of this section is to present the results of our calculations using the statistical model as 
represented by Eq. (10) and the weighting function, Eq. (12). Unfonunately, this is a daunting 
task because there are eight different inputs to the model: detector nonuniformity, emitter 
nonuniformity, sampling ratio, blur size, translation in two direcrions, and fill factor in two 
directions. We have attempted to choose those plots that will best represent general trends of 
relevance to the work going on at the KHILS facility. 

First, consider Fig. 5. This plot is made with the same alignmeni eis Fig. 3 and as illustrated 
for a particular sampling ratio in Fig. lb. However, this time the full model is used so we have 
selected a detector nonuniformity of 0.004 and a projector nonuniformity five times larger than 
that. The major difference, though, is that optical blur is now taken into account. Clearly, 
as the blur size increases, the effect of positional nonuniformity greatly diminishes. Blurring 
reduces positional nonunifoimity so much that another plot with a different translation would 
be verj- similar to this one. 
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Figure 5: Output NU versus Sampling Ratio with Det. NTJ == 0.004, Proj. NU = 0.02. Tx = 
Ty = 0.0, and Fx — Fy = 1. 

Figures 6-12 show output image nommiformity as a function of projector and detector non- 
uniformity. The first one, Fig. 6, has an integer seunpling ratio of = 2. The result is essentially 
equivalent to our previous work[l] and shows vi\Tdly how much more strongly the output de¬ 
pends on detector nonuniformity than emitter nonuniformit}' because of the weighting factor 
in Eq. (10). Fig. 7 is imchanged except for choosing a sampling ratio of 2.15. This sampling 
ratio corresponds to one being used at KHILS during July, 1998, but it is close enough to the 
integer 2 that the positional nommiformity does not have much of an efiect. Close examination, 
however, will show that at the origin, where there is no emitter or detector nonuniformit>'. there 
is a very small output nonuniformity. 

Figure 8 also has a sampling ratio that was t\pical during July, 1998. Even though ^ = 1.04 
is quite close to an integer, there is significant positional nonuniformity, which causes the output 
image nonuniformity to be significant even at the origin. The reason this noninteger sampling 
ratio is so important is because it falls between the integers 1 and 2, a region of maximum 
positional nommiformity (cf. Fig. 5). Figure 9 has the .samp ling ratio | which causes the largest 
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Figure 6; Output NU versus Detector NU and Projector NU with n/m = 2, K = Tx = Ty — 
0.0, and Fx — Fy = 1. 

positional nonuruformity possible. Figures 10-12 were created under the same circumstances as 
Figures 7—9 except that the blur size has been increased from K — 1 to K = 1.5. 

Figures 13 and 14 plot output image nonuniformity as a fimction of translation in the diagonal 
direction (T* = Ty) for a few different sampling ratios. The first plot has the smaller blur; 
obviously it has more nonuniformity. Interestingly, the translation can make a big difference for 
certain sampling ratios. Incidentally, the sampling ratio of 1.5 which causes the large arch in 
the first figure also causes an arch in the second. However, in the second plot the height of the 
arch is so little that it does not really show up on this scale. 

Figmes 15-18 show the output nonuniformity as a function of blur size for different fill 
factors. The fill factors indicated are the fraction of fill in one direction, but the fill is being 
changed in both directions simultaneously (eg. F = 0.9 is an area fill factor of 81%). Each 
successive graph is for a greater sampling ratio. The first and last graphs have sampling ratios 
that were being used in the lab when this work was done. Figure 16 was chosen because its 
sampling ratio causes maxim-mi positional nonuniformity and Fig. 17 was chosen because it is 
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Figure 9: Output NU versus Detector NU and Projector NU with njm = 1.333, K = 
Ti = Ty = 0.0, and F;r. = Fy = 1. 



Figure 10: Output NU versus Detector NU and Projector NU with n, m = 2.15, K = I 
Tx = Ty = 0.0, and Fx = Fy = 1. 
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Figure 16: Output NU versus Relative Optical Blur Size with Det. XU = 0.004, Proj. NU 
0.02, n/m = 1.333, F, = Ty = 0. 
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Figure 17; Output NU versus Relative Optical Blur Size with Det, NU = 0.004, Proj. NU 
0.02, n/m = 1.5. Tr. = Ty = 0. 
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Figure 18: Output NU versus Relative Optical Blur Size with Det. NU = 0.004, Proj. NU 
0.02, n/m = 2.15, = Ty = 0. 
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Figure 19: Output NTJ versus Relative Optical Blur Size with Det. NU = 0.004, Proj. NU 
0.02. n/m = 1.04, Fz = Fy = 1. All translation lines overlap. 
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Figure 20: Output NU versus Relative Optical Blur Size with Det. NU = 0.004, Proj. NU 
0.02. n/m = 1.333. F^ = Fy = 1. AH translation lines overlap. 
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the one noninteger sampling ratio in this range (for this alignment) which has no positional 
noLuniformity. It should be reminiscent of last year’s work. Figures 16 and 17 are both a little 
surprising m theu behavoir at small blur. The reason is that at the no-blur hmit, emitters 
effectively drop off the edge of the active area of the detectors. Many times this results in one 
emitter being seen by only one detector—thus the nonuniformity becomes that of the emitter 
axraj (which in this case dominates the detector array nonuniformity). 

Figures 19 and 20 show output nonuniformity as a function of blur size for different trans¬ 
lations. The two sampling ratios chosen for these two graphs happen to be ones that cause 
translation to have very little effect. Each would appear as one of the straight lines in Figs. 13 
or 14. Nevertheless these plots are useful, partly because they provide a different pespective, 
and partly because they again show the effect of positional nonuniformit>-. Figure 20 curves up¬ 
ward more steeply at small blur because that particular sampling ratio causes more positional 
nonuniformity. However, at larger blur sizes, where positional nonuniformity is not much of a 
factor, it has slightly less nonuniformity because the sampHng ratio is higher. 

4 Experimental Concerns 

Due to the nature of a model or a calculation, there are always going to be questions regarding 

Its relevance to experimental results. This section attempts to address those issues that are most 
likely to cause concern. 

4.1 Accuracy of the Measurement of the Projector’s Nonuniformity 

Experimentally, the determination of a projector’s nonuniformity is a difficult task. Therefore, 
of the parameters that are entered into the statistical model, the projector’s nonuniformity ^ 
prooably has the largest error. How much uncertainty does this create in the calculated test 
antcle’s output nonuniformity An equation predicting this contribution can be found using 
standard error propagation techniques: 




Using Eq. (10) and doing the partial derivative yields 



( 15 ) 


which reduces to 








16 


(16) 


t (^) 

It is logical to assume that the average emittance <j) is well-known compared to the uncertainty 
a a- If that is true to the extent that all the uncertainty is in the measurement of the uncertainty, 
then 

(17) 




Substituting this approximation into Eq. (16) gives a simpler form, 
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(18) 


Thus, if the uncertainty in the projector nonuniformity can be estimated, then its effect on the 
accuracy of this model can be calculated. 


4.2 The Point Emitter Approximation 

Perhaps the greatest discrepancy between this model and the experimental situation exists 
because of the use of point emitters in the calculation. In reality the emitters are extended 
sources. The most recent published description of a WISP array lists a 309o fill factor for the 
emitters.[8] Obviously, this is not quite a point source, but then neither is it a 100% fill factor 
array. 

It would seem ob\dous that for this model the difference between a point source and a 30% 
fill factor extended emitter will make essentially no difference when blm sizes of K > 2 are 
used. Since the sampling ratio is alwa^'s at least unity, K = 1 means that the center of the 
image of a point source will be positioned within the Airy disks resulting from its neighboring 
emitters. The true extended nature of the emitters will be obscured by the blur. As the blur size 
is reduced, however, the difference between an extended emitter aind a point somrce will become 
more important, but larger sampling ratios will still minimize the difference. When AT < 1, the 
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Airy disk will be smaller than the detector spacing, so for seimpling ratios approaching uni ty 
the difference could be significant. 

The main implication of the point source approximation is that it makes the model give 
\^lues for nonuniformity which must be worst case. Clearly, the more isolated the emitters, 
the greater the chances of nonuniformity. The difference bet^'een extended emitters and point 
emitters will show up primarily in the positional nonuniformity. If the emitters represented 
by dots in Fig. (1) were larger, then the radiation would be distributed more evenly between 
the detector elements. This reduction in the positional nonuniformity is very similar to that 
observed as blur size is increased. 

The use of a point source approximation is not an inherent Hmit to the approach used by this 
model. With some effort, it should be possible to incorporate extended emitters in the model. 
Each emitter would have to be broken into a series of point emitters in much the same manner 
that the detector elements are already di\ided into detection points. 

4.3 Spatial Droop 

Spatial droop (also called busbar robbing) is the term used for the variation in fiux from emitters 
due to a variation in voltage from one emitter to another. The voltage variation is caused by 
generating bright scenes which require a great deal of current and thus cause a voltage droop 
across the array. [9] A study of how spatial droop would affect this model has been initiated. 
The results are interesting and so far seem to be reasonable. However, these thoughts should be 
considered preliminary at this point. 

The goal of this section is to analyze how the nommiformiU' due to spatial droop is combined 
with other sources of nonuniformity. If these calculations are correct, then experimental data 
can be used to estimate the variation of the voltage. It should also prove possible to subtract 
from the experimentally measured nonuniformity that part of the nonuniformity caused by a 
gentle spatial droop across the array. 

The power dissipated from a resistor R with a voltage V apphed across it is If the emitter 
is approximately a blackbody, this power equals the area of the resistor times where T is 
the temperature in kelvins and a is the Stefan-Boltzmann constant. 

The number of photons emitted within the detector’s bandwidth during the integration time 
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where B is a constant which converts from total dissipated power to photon flux, and / is the 
fraction of the flux within the bandwidth of the detector. If the emitter is a blackbody, 
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Since w^e want to distinguish changes due to variations in voltage from fluctuation in the 
resistances themselves, it is helpful to write 
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( 21 ) 


where the resistor has resistance Ro at V^. and it is assumed that the voltage variations are 
small enough that only the hnear term is reeded. Inserting Exj. (21) into Eq. (19) yields 

V^2 


(f) = Bf - 

The standard error propagation equation can now be used: 
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First, we need to calculate the partial derh-atives: 
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and, taking ad\-antage of Eq. (22) along with a little algebra. 
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Putting these results into Eq. (23), yields 
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In this equation, the first term on the right, > represents the portion of due 

to the \‘ariation in emitter resistances. Tae nonuniformity of the projector array is normally 
understood to be this term. The second term in Eq. (26) accounts for the nonuniformity caused 
by spatial droop. 


1-27 




Maxlow' et al.[10] used an Amber AE4128 camera to analyze an emitter array. The camera 
had a 3-5/im bandwidth, and the apparent temperatures used were from approximately 300K 
to 350K. They state that when all 256 x 256 emitters were turned on to maximum value, the 
peak-to-\'alley difference divided by the average radiance was less than 10%. 


If the emitter is acting as a blackbody, then crT'* is proportional to so a 1% change 
in V produces a 0.5% change in T. At 350K, a 1% change in T (2% change in V) produces 
approximately a 5.5% change in /. Therefore, near 350K, for a blackbody emitter. 


Therefore, near 350K, 
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The third term in the square brackets depends on the coefficient of linear expansion 

and the temperature coefficient of resistivity of the material. For the silicon-nitride wafer resis¬ 
tors described in Ref. 8, this term shoiild be negligible compared with the other terms in the 
brackets. 2.75 and 2. Therefore, 



This equation implies that the 10% variation in radiance reported in Ref. 10 would have resulted 
from approximately a 2% variation in the voltages across the array. 

This result becomes important to the statistical model when scenes that are predominately 
bright are being projected by the emitter array. In such a case, a gradual droop is expected 
across the projector array. This droop will show up as a large nonuniformity in the standard 
measurements. However, a gradual droop will have little impact on the projector’s abihty to 
display fine features with the appropriate contrast for testing. 

Equation (29) provides a method of correcting the measured nonuniformity ^ for spatial 
droop effects because the final term represents the nonuniformity due to droop. Only the 
term really matters for the determination of nonuniformity which will degrade the simulation. 
This resistor nonuniformity can be calculated by subtracting the last term from the measured 
nommiformity. The problem with this technique is that ^ is not likely to be known. An 
equation could, however, be written that would model the voltage droop across the array and 
then the nonuniformity of the voltage could be calculated. 
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4.4 Rotation 


Some preliminary effort has been put into considering the effects of rotating the FPA relative to 
the projector array. So far it seems that rotation will effect both positional nonuniformity and 
the contribution of emitter airray nonuniformity. It does look like the model can be successfully 
made to include rotational effects. It is clear, however, that computational time will skyrocket. 

Conclusion and Recomendations 

The 1997 statistical model[l] has been sucessfully expanded and refined to account for non¬ 
integer sampling ratios. During this work, the phenomenon of positional nonuniformity was 
identified, and a method for quantifying it was devised. Positional nonuniformity exists in¬ 
dependent of detector and emitter nonuniformity. It is caused by different detector elements 
“seeing” the images of different numbers of emitters. The variation from detector to detector 
occius in a systematic and predictable manner with a two-dimensional spatial period we have 
chosen to call a chunk. Positional nonuniformity occurs for virtually aU non-integer sampling 
ratios, but decreases dramatically as integer sampling ratios are approached. Furthermore, the 
general trend in positional nonuniformity is smaller as the sampling ratio is increased, despite 
large local maxima between integer values. As the optical blur is increased, the effects of posi¬ 
tional nonuniformity decrease significantly. However, the model currently does predict significant 
nonuniformity in at least part of the optical blur range applicable to the actual experimental 
situation. 

All of the other issues in the proposal[5] for this effort have been addressed. Section 1.3 
discusses the concerns about offset nonuniformity and hopefully puts them to rest. .4 method 
for determining how the uncertainty of the model’s output depends on the rmcertainty of the 
important and hard to deter min e emitter nonuniformity is derived in Section 4.1. Significant 
progress has been made in identifying a means of handling spatial droop in the context of this 
model. In fact, the results of Section 4.3 provide a method for separating the effects of different 
emitter resistances from the \ariation in voltage applied to them because of droop. The last 
issue the proposal addressed was the effects of rotation of the detector array relative to the 
emitter array. This issue is still open, but enough preliminary work was done to determine that 
the nonuniformity will be affected by rotation and that quantifying its effect should be possible 
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as a logical extension of the current model. 

As part of this research, considerable effort was put into a consideration of the effects of 
diffraction and the relative validity of a two-dimensional Gaussian as an adequate approximation 
for the optical blur or point spread function. It is readily admitted that the Gaussian is only 
a rough but convenient approximation. It was also determined that a Fraunhofer diffraction 
pattern will not be observ'ed, primarily because of the multispectral nature of the radiation and 
partly because of the limited resolution. For now, the most expedient approach was to continue 
with the Gaussian approximation, but if this model is to be developed further, then this issue 
should be revisited. 

At this point there seems to be a number of things yet to do to strengthen and extend :he 
current model: 

• As discussed above, the use of the Gaussian approximation should be improved u::on. 
T'wo possible approaches come to mind. First, careful examination of appropriate data 
could lead to the formulation of a useful phenomenological approximation. This approach 
has been used at least once, by Flynn et al.[7], and it could be incorporated into the 
present work. Second, it should be possible to calculate at least approximately what 
the shape of the blm- would be by considering multispectral diffraction in the appropriate 
bandwidth. Combining these methods would have the advantage of providing a comparison 
between theory and experimental results, thus offering the opportunity for a more complete 
understanding of what is going on in this crucial area. 

• Probably the most critical area of concern regarding the present model is the use of a point 
emitter approximation when the real emitters are truly extended. The extended sources 
will certainly reduce the effects of positional nonuniformity, especially at small blur sizes. 
It is also true that the fact of extended sources will significantly alter the shape of the rlur. 
Thus, this issue is intimately connected to the previous one. A method for deahng —1th 
it in the context of this model is readily apparent. It should be possible to use a series of 
point emitter approximations to represent one extended emitter. Such an approach wruld 
not only improve this model, but should shed significant light on the blur shape issue. 

• We got far enough on the rotational work to determine that it should be doable with our 
approach. We have a high degree of confidence that with more time the model can be ex- 
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tended to include this effect. We believe it is an important issue because the nonuniformity 
will be affected and KHILS is planning to rotate the detector arrays during testing. Thus, 
it is important to know the degree to which the nonuniformity will be affected £ind which 
parameters are important. 

• Although a lot was accomplished with regard to spatial droop, it would be nice to model 
how spatial droop occurs and then subtract it from the predicted nonuniformity. Being able 
to do this would allow low-frequency spatial droop to be removed from the measmement 
of emitter array nonuniformity. This task is not expected to be difficult. 

• We have been told that a good model for nonuniformity would be useful to KHILS as it 
deals with potential customers. If this is true, it might be useful to have a professional¬ 
looking program and interface that would calculate and display the results of the model 
for particular input parameters. Currently eight different inputs are required, so it is 
impossible to provide a plot which covers every possible situation. The program is not too 
difficult to use as it stands; however, the output is a matrix of raw numbers which must 
be put into another application to produce useful graphs. This situation is likely to limit 
the use of the program. 

Much has been accomplished in the development of a useful model which will predict the non- 
uniformin- of a test article’s output image. Although there is much more that can be done, 
hopefully the present work will prove to be a useful tool. 
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1 Objective 

The objective in this effort was to implement a optimization algorithm for multiple 
Dallenbach layers based upon a genetic algorithm to provide broadband RCS reduction 
from a planar metal surface. Each Dallenbach layer has a certain thickness, d, and material 
properties of permittivity, e, and permeability, //. The layers are placed on the planar metal 
surface and the parameters are chosen so as to minimize some quantitative measure of 
the RCS versus frequency and incidence angle. A specific genetic algorithm has been 
implemented to minimize a RCS cost function with respect to the input parameters. This 
report provides a description of Dallenbach layers and the appropriate electromagnetic 
theory along with a brief description of the genetic algorithm. 


2 Technical Approach 

2.1 Introduction 

The techmcal approach used to solve this problem was based upon a binary genetic 
algorithm that encoded the Dallenbach layer thickness and material properties in an ap¬ 
propriate chromosome. These individual layer chromosomes were then combined into 
one large chromosome, which was subsequently used to evaluate the cost function for 
the next iteration. Convergence was assumed when the cost function varied within a 
prescribe tolerance between subsequent iterations. The technical approach used in this 
project was based heavily on a method previously published in the literature [1], and 
intended to enhance this method by using a different cost function. This complete ap¬ 
proach would then be suitable to include in a larger multidisciplinary design optimization 
(MDO) program for optimizing a flight vehicle for structures, aerodynamics and Radar 
Cross Section (RCS). This algorithm would provide the RCS optimization of the flight ve¬ 
hicle, given that the small surface patches could be assumed to be locally planar. 

The progress to date since contract award includes an operational C++ genetic algo¬ 
rithm code which can be used for optimization of radar absorbing material layers. The 
code is based upon reflection from a stratified media placed over a perfectly conducting 
backing (i.e. metal). The genetic algorithm used is a binary genetic algorithm which codes 
the material type and thickness for each layer in binary numbers. Various sections of the 
code have been debugged and tested and are working properly. The code has been bench- 
marked against analytical solutions for several two and three-layer Dallenbach layer con¬ 
figurations at different frequencies. The code was also benchmarked against previously 
published results in the open literature [1], but several discrepancies were noticed be¬ 
tween the previously published results, analytical solutions and the algorithm developed 
in this project. The previously published results differed from the analytical solutions and 
since our algorithm agreed with analytical solutions, it is assumed that the previously 
published results were either mislabeled or somehow incorrect. Future work to continue 
this project and genetic algorithm development include benchmarking the code against 
more complicated analytical solution configurations and determining the exact nature of 
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discrepancies with the previously published results. Extensive convergence studies also 
need to be performed to determine if a fitness scaling function is required to enhance the 
convergence of the genetic algorithm. Further enhancements and improvements to the 
genetic algorithm would also be in order. 

2.2 Theory 

A Dallenbach layer is a layer of dielectric material that is placed directly on the con¬ 
ducting surface as shown in Figures 2 and 3. The performance of the layer is analyzed 



Figure 1: Geometry for single Dallenbach layer for RCS reduction using perpendicular 
polarization. 

in terms of the reflection coefficient for both horizontal and vertical polarizations as a 
function of the permittivity and permeability of the material as defined by tr = e' — je" 
and Hr = n' — j h" > respectively. The thickness of the single Dallenbach layer also affects 
the reflection coefficient and it can be obtained from the zero reflection condition at the 
surface of the layer by the expression 


d = — tanh ^ 


(I) 


( 1 ) 


where 7 is the complex propagation constant in the material layer, Zq is the wave impedance 
of free space, Z is the characteristic wave impedance of the material and is given by 


Z = A'+jT 


( 2 ) 


where 




cos dr, 


( 3 ) 


2-3 






Figure 2: Geometry for single Dallenbach layer for RCS reduction using parallel polariza¬ 
tion. Note the difference in the electric and magnetic field vectors from Figure 2. 


y 

and j = and 5e, 5m are the electric and magnetic loss tangents, respectively. For 
a given Dallenbach layer thickness, equations (1), (3) and (4) can be used with an opti¬ 
mization algorithm to obtain the material parameters ej., /ij., e" and (since Sg and 5m are 
directly related to e" and /x"). Note that there are four degrees of freedom, but constraints 
can be placed on the variables to make the optimization a tractable problem. The single 
Dallenbach layer is a frequency sensitive method and v/orks best for normal incidence. 
It can be designed for off normal incidence if desired. Using multiple Dallenbach layers 
increases the absorption bandwidth, leading to broadband RCS reduction. 

For multiple Dallenbach layers and_^ parallel polarization [2], assume the geometry 
consists of M layers, each with a thickness, dm, and with material parameters Cm, fJ-m and 
(Tm- Further assume that the Mth (or last) layer is a semi-infinite layer with dju ^ oo so 
that no reflected waves exist in the last layer (M). The reflection coefficient, R\\, is given 
by the expression 


R - ^0 - 
" Ko + 


(5) 

where 


,7 _ ^m+l "I" -hm td'l^h.Umdm 

^m+l tanh Umdm 

(6) 

is the impedance of the m-th layer with 


L' 

Rm = -:- 

(7) 
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and 


“m = \/A:o + 7m 

The term is the free space wave vector defined by Ao = u}/c and the term ■'im is the 
complex propagation constant for material m defined by 


7m — 




JU)(7mk“m 


,2 


(9) 


In the case when the last layer is a conducting surface, the impedance of the final layer is 
given by 



Zm = 0 

(10) 

For the perpendicular polarization, the reflection coefficient is given by 



h- 

II 

+ 1 

(11) 

where 

Ym+1+Nmta.nhUmdm 
^Nm + Ym+lt^nhUmdm 

(12) 

is the admittance of the 

m-th layer with 



iV - 

(13) 


and Um as defined in (8). For the case in which the last layer is a conducting surface, the 
admittance of the final layer is given by 


Y^ji y oo 


(14) 


2.3 Description of Optimization Method 

The original plan in the statement of work for this project called for implementation 
of a projected gradient optimization algorithm for multiple Dallenbach layers to provide 
broadband RCS reduction from a planar metal surface. However, in a search of the liter¬ 
ature, a different approach was identified as a preferred optimization method. This opti¬ 
mization method is a genetic algorithm, which has already been used for various appli¬ 
cations in electromagnetics. The genetic algorithm was chosen over the project gradient 
method for several reasons. A genetic algorithm optimizes with continuous or discrete 
parameters, it does not require gradient information, it simultaneously searches from a 
wide sampling of the cost surface, it deals with a large number of parameters and is well 
suited for parallel computing, it optimizes parameters with complicated cost surfaces, it 
does not get stuck in a local minimum, it provides a list of optimum parameters, it can 
work with encoded parameters and, finally, it can work with numerical data, experimen¬ 
tal data or analytical functions [3]. A genetic algorithm has already been used to design 
lightweight, broad-band microwave absorbers suitable for reducing radar signatures [1] 
and this approach forms the basis for the work. Reference [1] provides a good description 
of the entire genetic algorithm, and some highlights of that approach will be briefly cited 
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here. 


This genetic algorithm will optimize multiple Dallenbach layers for minimum reflec¬ 
tion given a predefined set of frequency-dependent materials. The optimization will si¬ 
multaneously determine the optimal material choice for each layer and its thickness over 
a prescribed range of incidence angles and frequencies. An upper bound is placed on 
the number of layers and the thickness of each layer. The material for each layer is bi¬ 
nary coded in a sequence A/} of m bits and the thickness is binary coded in a sequence 
using t bits. The material choice for each layer, Lj = MjTj, includes the material and 
thickness bits. A chromosome, G, in the genetic algorithm is formed by combining the 
material choices for all layers asG = L 1 L 2 ...LN 1 . The flowchart for the genetic algorithm 
approach is shown in Figure 3. This flowchart represents a typical genetic algorithm with 



Figure 3: Flowchart for proposed genetic algorithm to optimize a series of Dallenbach 
layers over a planar metal surface for reduced radar signature. 

a selection phase, a crossover (or "mating") phase and a mutation phase. The details of 
each phase given in [1] are quite lengthy, and will be omitted here for the sake of brevity. 
However, a good description of the various components of this particular genetic algo¬ 
rithm are given in [1] and descriptions for more generic genetic algorithms are given in 


2-6 






The initial population, Pq, is filled by constructing random sequences of bits for each 
chromosome in Pq. The initial population is then sorted and any clones are replaced by 
random sequences until all initial sequences are unique. The top one-half of the initial 
population, Pq, then becomes the iterative (or working) population, P,. This population 
is subsequently used during the iterative portion of the optimization algorithm. 

The crossover (or mating) phase uses the top 1/2 of the iterative population of the 
genetic algorithm. Chromosomes are randomly paired and a random crossover bit (other 
than 0) is selected for mating. The child sequences are obtained from the parent sequences 
by combining the group of bits to the left of the crossover bit of the first parent with the 
group of bits to the right of the crossover bit of the second parent, and vice versa [?, 
michielssen93a] Any zero child sequences are replaced with a random sequence, and any 
clones are replaced with random sequences to add further versatility to the optimization 
procedure. 

At the present time, a mutation phase has not been implemented in the optimization 
procedure. It is, however, a simple matter to add this feature to the current optimiza¬ 
tion algorithm. As mentioned in [1], a fitness scaling function was employed so that the 
genetic algorithm would not lead to premature convergence due to a few highly fit se¬ 
quences in the initial population, or from poor convergence due to a large number of fit, 
but very similar, sequences. This fitness scaling function was also not implemented in the 
present algorithm, but could be implemented in the future after extensive convergence 
studies have been performed. Convergence is achieved when the difference in the mean 
of the cost function from one iteration to the next is less than a prescribed tolerance level. 


The cost function used in this optimization procedure is based upon the reflection 
coefficient. The genetic algorithm optimizes a cost function related to the reflection co¬ 
efficient over several incidence angles (9) and frequencies (/) of interest. The actual cost 
function is given by 


/ — TnaXi^j 




(15) 


where the i and j denote individual incidence angles and frequencies, and R is the reflec¬ 
tion coefficient for both parallel (||) and perpendicular (J_) polarizations. The cost function 
is averaged over all incidence angles and frequencies to determine the mean cost, which 
is the final value of the cost function used for optimization. 


2.4 Results 

To benchmark the genetic algorithm code against analytical solutions, several simple 
Dallenbach layer problems were constructed for the parallel polarization to verify the 
cost function calculation portion of the optimization procedure. The first case is a single 
Dallenbach layer over a PEC backing. The material layer is 2 mm thick, with material 
properties e, = 10 + jO, = T and the frequency is 1 GHz. The analytical reflection 
coefficient is yexact = 0.996 — j0.084 and the genetic algorithm code produces the result of 
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1 

2 

3 


layer 1 

14,0.6 

16,1.0 

15,0.4 

11 ,0.8 

layer 2 

oo 

o 

bo 

13,0.4 

14,1.8 


layer 3 

6 ,1.0 

13,0.6 

11 ,1.2 

8 ,0.4 


Table 1: Output designs for 3-layer optimization problem from 0.1-10 GHz. Values shown 
are material numbers corresponding to those in [1] and thickness expressed in mm. 


Tga — 0.996456 — jO.0841199, which is excellent agreement. 

Next, a two-layer Dallenbach scattering problem was set up with parameters: ti = 
t2 = 2 mm, €ri = 10-1- jO, €r 2 = 50-1- jO, = Mr2 = 1-0 + jO, and the frequency was again 
1 GHz. The analytical result is Fexact = 0.985 — j0.172 and the compDesign uted result is 
Tja = 0.98502 -jO. 172442. 

For a three-layer Dallenbach scattering problem, the parameters were: ti = t 2 = = 2 

mm, Cri = €r3 = 10-1- jO, Cr2 = 15 4 - jO, }iri = IJ-t2 = = 1-0 + jO, and the frequency was 

1 GHz. The analytical result is Te^act = 0.96489 — j‘0.2626495 and the computed result is 
Tja = 0.964861 — jO.262761. If we change material 2 to 6^2 = 50 -f jO, the results are then 
Fexact = 0.96075 - iO.27742 and Fga = 0.960713 - jO.277545. These Dallenbach scattering 
problems therefore show that the cost function is being evaluated correctly for various 
types of input situations involving lossless dielectric materials. 

To further calibrate the genetic algorithm code and the associated reflection coefficient 
calculations, several optimal designs presented in [1] were input to the genetic algorithm 
code to test the accuracy of the reflection coefficient calculations. The designs tested were 
the LFl, LF2, LF3 and LF4, and HFl, HF2, HF3 and HF4. These results disagreed with 
the graphical plots presented in [1]. A separate computer program written in FORTRAN 
was developed to provide the analytical reflection coefficient for a stratified media based 
upon the theory presented in this report for the parallel polarization. The LFl-4 and HFl- 
4 designs were input to this code, which provide the same results as the C+-t- genetic 
algorithm code. This independent verification indicates a possible problem with the pub¬ 
lished results. However, due to time considerations, the exact cause of this discrepancy 
was not investigated. 

To demonstrate the optimization algorithm, two sample problems were executed. The 
first problem is a three-layer optimization problem over the frequency range 0.1-10 GHz, 
normal incidence, with a maximum individual layer thickness of 2 mm. An initial pop¬ 
ulation of 8 chromosomes was used, which results in an iterative population of 4 chro¬ 
mosomes. Each chromosome was randomly initialized, and the genetic algorithm was 
applied to optimize the design. The output designs are shown in Table 1. The analyti¬ 
cal reflection coefficient results for these designs are shown in Figure 4 at the end of this 
report. The graphs show that indeed the Design #1 shows the best mean, or average. 
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1 

2 



layer 1 

14,1.0 

3,1.4 


Esai 


12 ,1.2 

2 ,1.0 

16,1.6 

5,1.8 

layer 3 

11 ,0.6 

16,0.8 

13,1.0 

4,1.2 

layer 4 

9,0.8 

15,0.4 

13,1.2 

BEII 

layer 5 

8 ,0.2 

13,0.4 

12 ,0.6 

1 ,0.8 


Table 2: Output designs for 5-layer optimization problem from 0.1-10 GHz. 


reflection coefficient over the entire frequency band, and it was chosen as the optimal de¬ 
sign. 

The next sample problem was the same, except the number of layers was increased 
to 5. The results for this case are shown in Table 2. The reflection coefficient results for 
each design in this problem are graphed in Figure 5. Note that although Design 4 shows 
a large null in the reflection coefficient, it appears that Design 1 has the lowest mean re¬ 
flection coefficient over the frequency band. The difference between the final mean cost 
functions between Designs 1 and 4 is approximately 8 dB. 

A final, larger problem was run which included an initial population of 64 chromo¬ 
somes, an iterative population of 32 chromosomes, 100 iterations, 5 layers, 10 incidence 
angles from 0 — 90°, and the same frequency band of 0.1-10 GHz. The first two output 
design reflection coefficient results for normal incidence are shown in Figure 6. Note that 
Design 1 offers the lowest mean reflection coefficient over the frequency band of interest. 


3 Conclusions 

The goal of this project to create an optimization algorithm for optimization of radar 
absorbing material layers has been achieved. The algorithm is based upon a binary ge¬ 
netic algorithm, which optimizes a reflection coefficient cost function over a range of fre¬ 
quencies and incidence angles of interest. The code was verified against analytical solu¬ 
tions and attempts were made to verify the code with previously published results in this 
area. The algorithm was demonstrated on various optimization problems to illustrate 
its operation. Future work for this project would include extension to the perpendicu¬ 
lar polarization, enhancements to the genetic algorithm via improvements to weight the 
cost function with a measure of layer thickness or overall thickness, further validation 
and verification efforts and further improvements in mating/crossover procedures and 
algorithm convergence studies. 
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1 Problem Description 

In an Automatic Target Recognition system, vehicles are classified as belonging to either a 
target class or a confuser class (the set of non-target vehicles). Typically, this classification is 
achieved by selecting a member of the target class, and then creating a set of templates based 
on the images of the selected vehicle. The set of templates are used to represent the entire 
target class. 

The classification decision is determined by the Mean Square Error (MSE) distance from 
an unknown image to the set of templates. A small distance indicates a close match to the 
templates, and therefore the unknown image is classified as belonging to the target class. 
Otherwise, the image is determined to belong to the confuser class. 

One problem with this method arises when classifying certain classes of vehicles, such as the 
T72 class. Some members of the target class are classified correctly, while others are classified 
as confusers. The performance depends heavily on the vehicle of the target class that is chosen 
to create the templates. If there is a large degree of dissimilarity among the vehicles in the 
target class, then more than one set of templates is required to accurately represent the entire 
class. A solution is to design class models that together represent the entire target class. Such 
models are created by combining images from several different vehicles in the class. 

One of the issues when building class models is determining the number of sets of templates 
to be used for a given target class. Another issue is selecting which images of the target class 
to use to create the sets of templates. This thesis presents methods for handling these issues 
and building class models. 

2 A Pattern Recognition Problem 

The goal of air-to-ground Automatic Target Recognition (ATR) systems is to detect potential 
targets in an environment with clutter in the background and then to determine whether 
the object in question is in-fact a desired target. An ATR system requires several steps to 
accomplish this task, and uses several components. An introduction to ATR systems can be 
found in [?]. This thesis has been placed in the context of ATR, where the sensor data are 
two-dimensional images from a synthetic-aperture radar (SAR). 

The techniques that have been presented are aimed at discriminating between target vehi¬ 
cles and other confusing vehicles. The context of our work is the recognition problem when the 
target image has already been extracted from the environmental ground clutter surrounding 
it. The particular target class vehicles that were used for testing included a set of eleven T72 
tanks. A complete description of the data set is given in appendix ??. 

3 Template-Based Classifiers 

The problem described in section 2 is essentially a two-class discrimination problem, where 
the vehicles belong to either a target-class or a confuser-class. A classifier must be designed 
to determine which class a given SAR image belongs to. One way of achieving this is to train 
the classifier with the images of a member of the target class. Test images from all the other 
vehicles are then classified based on how closely they match the classifier templates. 
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The SAR images of vehicles that are used for both training and testing are also known as 
chips. Each image chip is a two-dimensional, 128 by 128 gray-scale image, representing the 
vehicle at a specific aspect angle (0-359 degrees). The depression angle, with respect to the 
sensor collecting the SAR data, is 17 degrees for all the data used in this study. 

The classifier is implemented as a set of templates, where each template represents a section 
of the vehicle used for training. Each template represents a five-degree section of the vehicle, 
and is created from the images of the vehicle included in the five degree angular window. Since 
each section is a five-degree window, seventy-two templates are needed to represent the entire 
three-hundred and sixty degrees of a training vehicle. 

To build the individual templates, the image chips of the vehicle that fall within each five- 
degree window are collected. For example, the ten-degree template of a classifier is formed 
by collecting all image chips of the training vehicle representing aspect angles 7.5 degrees to 
12.5 degrees (10±2.5). The image chips are aligned, and then averaged together to form the 
template for that section. 

3.1 Classifying images 

With template-based classifiers, the distance from a template to the unknown image is used 
for classification. The test image chip must be compared to each of the seventy-two templates, 
since the aspect angle of the test image is unknown. The distance to the closest matching 
template is used for classification. The program used to compute these template-chip distances 
is the “bright-region MSE classifier” (brmse), a program written by Ron Dilsavor of Sverdrup 
Technology. It implements Sandia’s bright region MSE classifier design. 

The performance of a classifier can be measured by how well it discriminates members of 
the target class from members of the confuser class. The target class used in this thesis is a 
set of T72 vehicles, and the confuser vehicles are all non-T72 vehicles. The template vehicle, 
or training vehicle, will be selected from the set of target vehicles. 

Two types of distances are collected to test performance: Distances from the template to 
the target chips and the distances from the template to the confuser chips. Figure 1 shows how 
these two collections of distances are distributed, using T72 model number A32 templates as 
an example. The solid-line is a histogram representing the distribution of distances between 
image chips from the entire class of T72 vehicles and the closest matching A32 template. In 
contrast, the dashed-line histogram shows the distribution of chips from the set of confuser 
vehicles, again using each chip’s distance to the closest matching A32 template. It is the 
overlapping portion of the histograms, in this case approximately .1 to 1, that makes the 
classification decision difficult. The more that these two distributions are separated, the easier 
the classification. The following section introduces a way to compare the relative performance 
of two or more templates, in terms of their ability to separate these distributions. 

3.1.1 ROC curves 

A standard way of measuring the performance of a template classifier is in the form of a Receiver 
Operating Characteristics curve, or a ROC curve. A ROC curve is a plot of the probability 
of detection (PD) versus the probability of false alarm (PFA). The PD is the probability of 
classifying a vehicle as a target, given that it is a target. The PFA is the probability of 
classifying a vehicle as a target, given that it is a confuser. 
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Figure 1; Distribution of template-to-chip distances 
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Figure 2: ROC curves 


The ROC curve is generated using the two distributions of distances previously described, 
and demonstrated in Figure 1. The PD coordinate is determined by the number of chips in the 
target class with a distance less than some D, divided by the total number of target chips. As 
the distance D increases, more chips are included in the count, and the PD increases. These 
points are plotted against the PFA coordinate, which is the number of chips in the confuser 
class with a distance less than the D, divided by the total number of confuser chips. As more 
confuser chips are included in the count, by the increasing distance D, the PFA also increases. 

Figure 2 demonstrates a comparison of two ROC curves. The target class is all eleven T72 
vehicles, and the confuser claiss contains three BMP2 variants, the BTR70, the BTR60, and a 
T62. A detailed listing of the vehicles used in the comparisons can be found in appendix ??. 
The solid-line ROC curve shows the result of using T72 model number A32 templates as the 
classifier, and the dashed-line ROC curve shows the result of using T72 model A64 as the 
classifier. Figure 2 indicates that an A64 discriminates the class of T72s better than an A32 
vehicle, since at any point the probability of detection is higher for a given probability of false 
alarm. 

In all of the experiments presented in this thesis, the vehicle used for training is excluded 
from the target class. For example, with the A64 classifier in Figure 2, the distance from the 
A64 template to the A64 image chips was withheld when generating the ROC curve. The same 
was done with the A32 ROC curve and the A32 template-chip distance. Only the template 
distances to the other ten vehicles were compared to the confuser chip distances. This is done 
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so that a comparison does not reflect a bias when a vehicle used for training is the same vehicle 
the classifier is tested on. 

4 Effects of Intra-Class Variability 

The accuracy of the classification depends on the vehicle chosen for forming templates, since 
that is essentially the vehicle to which a test image is compared. The problem is that a class 
of vehicles may consist of members that are very different from the other members of the 
class. This variation within a class makes it difficult for a classifier to discriminate target class 
vehicles firom confuser class vehicles. 

The reason is that a classifier designed with any one member of the class may perform very 
well for some members and very poorly for other members. For example, a classifier designed 
with a T72 model number A62 performs well when tested with a model number A63, but not 
as well when tested with an AOS. The vehicles that are most similar to the training data can 
be classified more accurately. If a class has few vehicles that are similar, or has a high degree 
of variability, then the classification performance for any unknown member of the class will be 
unpredictable. 

The eleven T72 vehicles are a class with a large degree of variability. Figure 3 demonstrates 
this, with a comparison of several ROC curves. Each curve shows the performance of particular 
T72 templates when discriminating the class of T72 targets from the confuser vehicles. It is 
evident from the plot that the performance of the classifier varies significantly depending on 
which T72 is used to create the templates. For example, although model number AlO performs 
relatively well, model number A32 does poorly when discriminating targets from this large set 
of confusers. The class of T72 vehicles illustrates the problem of intra-class variability when 
designing template-based classifiers. 

5 Goals of This Project 

The objective of this research project was to design algorithms for constructing models of 
classes of vehicels and to determine the effectiveness of such models for Automatic Target 
Recognition (ATR) Systems. A class of vehicles, such as the T-72s, has large variation among 
all the members of this class. Due to this variability, an ATR system trained on one member 
of the class may not be able to recognize all the other members as belonging to its own class. 
It is desirable, therefore, to design models of vehicle clEisses that deemphasize the intra-class 
variabilities while retaining enough information to discriminate against non-members of the 
vehicle class. 

Section 8 introduces a measure for determining the theoretical compactability of a class of 
vehicles into a smaller number of representative models. This measure can help in choosing an 
appropriate number of models, with the corresponding expected error, for a class of vehicles. 
Section 9 presents a simple clustering algorithm to select the vehicles to be merged together 
while forming templates for the combined representative models. The approach in Section 10 
incorporates the aspect-angle dependent variability of the class in the model building process, 
and thus provides more flexibility to the selection of images for constructing class models. In 
Section 11, the experimental results are shown for classifiers designed usingboth the above 
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ROC plots for discriminating T72s from Confusers 

Figure 3: Intra-class variability of T72 vehicles 
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class models. We show significant improvement in performance by these class models. 


6 Related Work 

The difficulty of separating vehicles in the same class, especially the T72 class, is demonstrated 
and explored in [WPB97] using template-based classifiers. One approach taken was to train 
a classifier with templates that are created by averaging the templates of all vehicles in the 
target class. Improvement was shown for a class of three T72 variants. The other techniques 
developed in [WPB97] involved normalizing and thresholding the images before the templates 
were formed. In this thesis, the benefits of mixing templates is further examined on a much 
larger set of T72s, where the focus is on creatively selecting images of the different vehicles 
to be used in template formation. Also, in chapter ?? a thresholding scheme is shown which 
processes images after template formation. 

In [SKW''98], a classifier is designed with models using MSTAR Predict, to compensate for 
various target poses and intra-class target variability. MSTAR Predict produces a simulated 
radar image from a CAD model, which can be used to classify images according to the vehicle 
modeled. The paper shows that these models are still susceptible to the problem of intra-class 
variability, which is “generally more difficult” than handling target pose in Predict models 
[SKW+98]. Results are mentioned with improved performance by including information about 
the intra-class variability of T72 models. This thesis complements their work be developing 
techniques to extract information about the T72 class variability for better classification. Also, 
our classifiers are designed using actual measured SAR images, and not predicted signatures. 
Although [SKW+98] demonstrates the use of the MSB metric with MSTAR Predict models, 
no steps are taken in this thesis to show how our results apply to these models. 

A detailed study of intra-class variability is given in [BDMP98]. The performance of tem¬ 
plates created from combining images is tested, using a few types of vehicles from the T72 class 
The intention of this thesis is to extend the work on combining images, providing techniques 
to determine how many and which images to combine. [BDMP98] also performs tests using 
signature prediction with DEMACO’s XPATCH system, modeling a specific T72 vehicle. The 
classifiers in this thesis were only trained on measured SAR data. 

7 Class Models 

The intra-class variability of target vehicles degrades the performance of a classifier. The ap¬ 
proach taken in this thesis is to create class models, or a number of subgroup representative 
classifiers that together represent the entire class of T72s. This improves the overall perfor¬ 
mance when discriminating such classes against confusers. A class model is based on a set of 
data to train the classifier, that is meant to represent an abstract type of vehicle in the class. 
They are created using images from several different tanks in the class. 

Using the T72 class as an example, one such class model may be designed to represent 
vehicles A04, AOS, A07, and the AlO, assuming these vehicles belong to the same abstract 
type of T72. If there are four subgroup models used to represent the entire class, then all four 
models would be required to adequately discriminate all target vehicles. This is considerably 
less computation than using each of the eleven individual vehicles to discriminate the targets 
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from confusers. Also, since information about each vehicle is incorporated into the models, 
they should perform better than any individual vehicle in the class. 

7.1 Class models with templates 

Since template-based classifiers are being used, a class model will be represented by a set of 
templates created using measured SAR images of vehicles. Bach template will be formed by 
averaging a list of images from (potentially) several different vehicles. If this is done for each 
aspect angle, then the performance of the model can be tested in the same manner as templates 
that were created from a single vehicle. 

Two important issues arise when designing such class models. The first issue is determining 
the number of models to be used for a given class. This will depend on the degree of variability 
inside the class. The second issue is deciding which particular images will be used to form the 
templates of a certain model. 

8 Class Compact ability 

A first step in building class models is determining how many models should be used to 
represent a single class. This number will depend on the degree of variability within the class. 
A class with high variability would require more models to represent it than a class with less 
variability. For example, a class of vehicles may consist of many similar members, or it may 
have several very different members with only few being relatively similar to each other. Only 
similar vehicles could be represented accurately by the same model. A notion of compactability 
is introduced as a guide for determining the number of models to be used to represent the class. 

8.1 A Measure of Compactability 

Informally, the compactability of a class indicates the extent to which it can be represented 
by fewer number of models without losing information required to classify individual members 
of the class. If a class has a high degree of compactability, than the members of the class 
are relatively similar to each other. A class with lower compactability measure would indicate 
members that are relatively dissimilar. 

The N known members of a class can be represented as N points in a distance space. A 
point has a location in this distance space defined by its distance from every other member in 
the clciss, therefore, resulting in an A - 1 dimensional space. As the number of dimensions 
used to represent the N members is decreased to N — 2, N — 3,... ,1, information will be lost 
and some error will be introduced. The dimension m, such that if N was reduced to m - 1, 
then the amount of information lost increases significantly, determines the compactability of 
the class. 

For the class of T72 vehicles, each point in the space is a member or vehicle. The distance 
between any two points is the MSB distance between the images. As we represent the points 
of the space in fewer and fewer dimensions, we are assuming fewer types of vehicles required to 
represent the class. The structure of the class is shown by the distances between the vehicles. 
The compactability testing is a meeisure to determine how many of the vehicles are similar 
enough to be represented by the same class model. 
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Table 1: Table of vehicle distances 



132 

812 

S7 .. 

A64 

132 

0 

.43839524 

.40990450 

.45779017 

812 

.43839524 

0 

.43240280 

.44535691 

S7 

.40990450 

.43240280 

0 

.43279583 

A64 

.45779017 

.44535691 

.43279583 .. 
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8.2 Distance Metric 

To find the distance between two vehicles, the average MSE distance between an image and the 
nearest template from the other vehicle is computed. This is repeated for all pairs of vehicles, 
so that a distance is obtained from each vehicle to every other vehicle in the class. Table 1 
shows a sample table. 

In the case of the class of T72s, there are eleven vehicles. The eleven vehicles can be treated 
as points and placed in a ten-dimensional space, preserving the pairwise distances discovered 
in the previous step. 

The next step for computing the compactability is compressing the space. The points 
for the eleven T72 vehicles are compressed into successively smaller number of dimensions, 
preserving the pairwise MSE distances as much as possible. For example, the space will be 
reduced from ten to nine dimensions, then from ten to eight dimensions, continuing until finally 
ten dimensions is reduced to one dimension. 

8.2.1 Algorithm for Compaction 

The amount of error introduced by compressing the space has been measured in two ways. It 
has been measured in terms of the average MSE between the distances in the original space and 
the new distances in the compressed space, which is calculated during the mapping previously 
described. It has also been measured in terms of the reversed MSE distance pairs in the 
compressed space. The method used for calculating this second measure is shown in Figure 4. 

This algorithm measures how many distance pairs had smaller MSE distances than other 
pairs in the original space but have larger distances in the compressed space. Starting with a 
distance Dj in the compressed space, its position or rank in the original space is found. Each 
distance Dj that is below Di in the original space is tested to see if it is still below Di in the 
compressed space. If the difference between the distances Di and Dj is less than one percent, 
then the two distances are similar enough not to consider reversed. Otherwise, the relationship 
between the two points has not been preserved and the pairs are considered to be reversed. 

As the number of dimensions decreased, the MSE of the new distances between points 
continued to increase. The rank order, as shown in Figure 5, did generally increase as the 
space was compressed, but not in every case. 

The compactability results indicate that relatively more information about the structure 
of the class is being lost after the sixth and second dimension. At a low number of dimensions 
such as two, a large degree of error is introduced in terms of both MSE and reversed distance 
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Figure 4: Measuring reversed MSE distance pairs 

Given a table Ta of MSE distance pairs in the original space, and a table Tb of MSE distance 
pairs in the compressed space, 

• Sort Ta and Tt in non-decreasing order, creating ordera and orden respectively. 

• For each distance Di in orderb' 

1. Find index k of Di in order a 

2. For every distance Dj in ordera with an index greater than k: 

Find index I of Dj in orderb 
if I is less than i, then 
Compute the difference between Di and Dj 
If the difference is greater than 1 percent, then 
increment out — of — order by 1 


oulofordwll 



Figure 5: Impact of compressing space on rank order of distances 
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Figure 6; Clustering Algorithm 


1. Create dissimilarity coefficient matrix 

2. Sort rows of dissimilarity coefficient matrix in non-decreasing order by distance 

3. Initialize all points as belonging to their own cluster (p clusters) 

4. Scan sorted dissimilarity coefficient matrix, starting with the shortest distance. 

5. If the two points with that distance belong to different clusters, then merge the two 
clusters they belong to. 

6. While the number of clusters is greater than k, repeat to step 4 


pairs. This is evidence for using six class models to represent the class of T72 vehicles. 

9 Clustering 

Compactability gives some insight into how many models can be used to represent a cleiss. 
Once the number of models that will be used to represent a class is chosen, it is another 
matter to decide exactly which vehicles actually form the different types to be modeled. 

One approach is to have each class model represent an exclusive subset of the vehicles in 
the class. In this way, the four models together could discriminate the class of vehicles. An 
unknown image would only need to be compared to each of the four models to decide if it 
belonged to the target class. Clustering can be used to decide which subset of vehicles each 
class model will represent. 

To find a subset that only contains one type of vehicle, the vehicles should be grouped 
into sets of similar vehicles. The MSE distance between vehicle images can be used as a 
measure of similarity. If there are eleven vehicles, then each vehicle can be represented by a 
point in a ten dimensional space. The distance between any two points is the MSE distance 
between the corresponding vehicle images. The subsets of vehicles can be found by an exclusive 
classification of these points. The classification is exclusive to achieve disjoint partitions of the 
space (vehicles). This can be accomplished through clustering, where the number of clusters 
to be found will be the number of models to be used for that class. 

9.1 Clustering Algorithm 

The vehicles will be clustered according to their mutual MSE distances (the same distances 
that were used during compactability testing). Since the objective is to find points that are 
nearest to each other, a nearest-neighbor algorithm applies naturally. The steps taken in the 
algorithm are shown in Figure 6, where p is the number of points in the space and k is the 
number of clusters to be found. 
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Figure 7: Clusters found for eleven vehicles 


9.2 Vehicle Clusters 

The algorithm was applied to the T72 class of vehicles, represented by eleven points in an 
eleven-dimensional space. The number of clusters to be found was set to six, as was indicated 
by the compactability results. Figure 7 shows the resulting clusters. The axes represent the 
coordinates of the points after being projected into a two-dimensional space. 

One large cluster was found, consisting of the A04, AOS, A07, AlO, and A64 vehicles 
(labeled as an ‘x’ cluster). Another smaller cluster of two vehicles, A62 and A63, was found 
(labeled as an ‘o’ cluster). The other four T72 vehicles remained members of their own clusters, 
and were never merged with other vehicle clusters. 

This implies that a large portion of the T72 class (five of the eleven) represent the same 
type of T72, and a second type of T72 is made up of the smaller ‘o’ cluster. The other vehicles 
have no vehicle in the class similar enough to be considered as representing the same type. In 
this way, those four vehicles (132, 812, S7, A32) are their own type of T72. 

To build the class models, the same approach that is taken when forming templates for a 
classifier created from a single vehicle is taken when assembling templates for a model with 
images of more than one vehicle. The class model will consist of seventy-two templates covering 
five-degrees each. For each template, all the image chips falling within the degree range are 
collected firom all the vehicles in the cluster, instead of from a single vehicle. 
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10 Aspect-Dependent Clustering 

The previous approach clustered the images in a vehicle space, meaning they only allowed 
images to be considered as part of a particular vehicle. If this restriction is relaxed, we could 
look at the similarities in the vehicles across aspect angles. In this way, the image chips would 
belong to a degree-range group instead of a T72 group. For example, the image chips in the 
zero degree range (±2.5) from all different T72s can be clustered to form groups of T72 images 
that are similar from this aspect angle. If models are built based on the angle specific clusters, 
then it may be a more accurate representation of the class. 

Another restriction of the vehicle-space clustering is using a fixed number of clusters when 
creating the templates used in a model. This technique does not reflect the fact that there is 
more variability among the images of certain aspect angles than others. For example, the front 
of a tank contains much more detail than the rear of a tank, and such detail can be exploited 
by a well designed classifier. 

Clustering will be used to find similar image chips in the chip-space. As before, the clus¬ 
tering will determine which images are used to construct each class model. When clustering 
in the vehicle-space, the distances between vehicles had already been defined for use in com¬ 
parisons. The corresponding comparison is not defined in the chip-space. In order to calculate 
the clusters of image chips, a notion of mutual distances between all image chips needs to be 
formulated. The MSB distance for each chip to a common template was used. 

10.1 Clustering algorithm 

Figure 8 describes the algorithm used to cluster the images using a distance between chips 
in the same degree range. Step one initializes a table of image chips versus their distance to 
each T72. There are as many columns as there are vehicles in the class, and as many rows as 
there are chips in the degree range. The table is then iteratively searched to find the minimum 
values. If two minimum values are in the same column, that indicates the chips of those rows 
are nearest to the same vehicle, so their clusters can be merged. If two minimum values are 
found in the same row, then the chip of that row has already selected a vehicle that it is closest 
to, so the chip should not be clustered based on the second minimum value. The number of 
clusters is continually reduced by merging, until a desired number of clusters is found. With the 
T72 data, the number of clusters used was four. After all chips are clustered in the first degree 
range, a new table is initialized for the next degree range and four new clusters are formed 
from the new list of chips. This continues until all aspect angles (spanning five degrees) are 
clustered, each containing four groups. 

10.2 Variable Number of Clusters 

The next step is to accommodate for a variable number of clusters for each degree range. There 
will need to be some restrictions placed on the data, such as the minimum size of a cluster and 
the maximum number of clusters. Also, the number of clusters should reflect the similarity of 
the images at each aspect angle. These requirements were met by running the above clustering 
algorithm in two passes. 

During the first pass the chips are clustered into the maximum number of clusters that is to 
be allowed. For this particular data. The image chips were clustered into four groups. After the 
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Figure 8: Clustering Algorithm 

1. A given chip is compared to eleven templates - one from each of the T72s. This is 
repeated for all chips in the aspect angle window. Each chip is a row of a table, where 
the columns are the templates. 

2. Find the minimum value in this table and mark it. 

3. Find the next minimum value, mi, in this table. 

4. • If there is a marked value in the same row as mi, do nothing. 

• Else if there is a marked value in the same column as mi, then cluster those chips 
(rows) together and mark mi. 

• Else just mark mi. 

5. Repeat to step 3, until all chips are clustered. 

6. Repeat process (from step one) for all seventy-two degree ranges. 


first pass, the cluster information can be processed. The information that will be used is the 
number of chips that belong to each of the clusters found. For each cluster with less than some 
threshold number of chips, a Reduce - by value will be incremented. In this case a threshold of 
three was used, so that clusters with two or less chips would increment Reduce - by. After all 
the clusters have been inspected, the Reduce — by total will be subtracted from the maximum 
number of clusters used in the first pass. This calculated value will be the new number of 
clusters to be found. 

The second pass will start from scratch, re-clustering the entire set of image chips with the 
same algorithm as before. But during this pass, the number of clusters to be found will have 
been adjusted by the results of the first pass. 

These two passes will be applied to each of the seventy-two angle ranges, achieving the 
variable cluster numbers at each range. Figure 9 is a plot of the clusters found for the T72 
data set. There was more variability among chip images at the points where more clusters 
were required, such as around the zero degree range. 

10.3 Creating Models 

The class models are created by building templates for each degree range, as was done before. 
There will be as few as one template for a degree range, or as many as the maximum number 
of clusters allowed, depending on the individual aspect angle variability. Each template will 
represent a sub-class of the vehicles in the class, at a specific aspect angle. 

The list of chips that will be merged to form a template at a certain degree will be precisely 
the list of chips belonging to the cluster for that degree. The same program that was used 
before to create templates is applied to these new lists of chips. 
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Figure 9: Variability of each aspect angle 

11 Performance of Class Models 

11.1 First clustering approach 

This section presents the results of clustering in the vehicle-space. The performance of a class 
model designed using vehicles from a cluster is compared to the performance of classifiers 
trained on the individual vehicles of that cluster. 

The ROC curves in Figure 10 compare classifier performance using six confuser vehicles. 
clusterl is the class model created using vehicles from the large ‘x’ cluster of Figure 7. The 
other five templates demonstrate the performance of classifiers trained with the five vehicles of 
that cluster. The target class used in this experiment is also the five T72 cluster vehicles. The 
figure shows that the class model performed better than any individual vehicle of its cluster. 
Another experiment tested the performance of a class model designed using vehicles from the 
smaller cluster (A62 and A63). As before, the class model classifier out-performed classifiers 
trained on the individual members of the class. 

11.2 Second clustering approach 

This section presents the experimental results of a class model designed when clustering in the 
chip-space. The mix template is the class model classifier. Figure 11 shows ROC curves with 
a target class of all T72s and the entire confuser class. As the figure demonstrates, the class 
model performs better than any individual training vehicle, especially at the upper part of the 
curve where the probability of detection is higher. 

The ROC curves in Figure 12 compare the two class model approaches, clusterl being the 
vehicle-space model and mixed begin the chip-space model. The test is comparing with targets 
clusterl was designed to classify, which are named again in the figure. The results show that 
clusterl performs better, although not by a large margin. 
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Figure 10: Class model from the vehicle-space {clusterl) 



Figure 11: Class model from the chip-space {mix) 
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Figure 12: Chip-space model (mix) versus vehicle-space model {clusterl) 
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VALIDATION OF A LARGE EDDY SIMULATION CODE 
AND DEVELOPMENT OF COMMUTING FILTERS 


G. A. Blaisdell 
Associate Professor 

School of Aeronautics and Astronautics 
Purdue University 


Abstract 

A large-eddy simulation (LES) code developed at the Air Force Research Lab is applied to the problem 
of decaying compressible isotropic turbulence in order to help validate it. The code uses a high-order 
compact finite difference scheme coupled with a high-order compact spatial filter. Comparisons are made 
with previous direct numerical simulation (DNS) and LES results from a Fourier spectral code. For 
the DNS test case the finite difference code is found to give excellent agreement with the results of the 
spectral code. LES of a low Reynolds number case show that the finite difference code run on a 48^ grid 
gives results similar to those of the spectral code run on a 32® grid. The results from the finite difference 
code run on a 32® grid for both the low Reynolds number test case and a high Reynolds number test case 
show some differences compared to the results of the spectral code. It is believed that the discrepancy 
is due to the solution filtering procedure in the finite difference code, although further simulations are 
needed to test this hypothesis. The issue of the commutation of filtering and differentiation is considered. 
The filters recently proposed by Vasilyev are found to produce commutation errors that are less than the 
truncation error of the finite difference scheme used. Several suggestions are made for ways to complete 
and extend the current work. Extensions include a detailed examination of the effect of solution filtering 
and its relationship to filtering as it is viewed in classical LES. 


4-2 



VALIDATION OF A LARGE EDDY SIMULATION CODE 
AND DEVELOPMENT OF COMMUTING FILTERS 
G. A. Blaisdell 


1 Introduction 

Most flows of engineering interest are turbulent, and turbulence can have a large effect on the performance 
of engineering systems. For instance, the drag on a body is generally higher for turbulent flow, turbulence 
greatly increases heat transfer rates, and turbulent mixing is important to combustion systems. Numerical 
simulation of turbulent flows has become a valuable tool for gaining insight into the complicated flow physics 
of turbulence and it has provided data for evaluation of engineering turbulence models. 

Direct numerical simulation (DNS) of turbulent flows involves solving for the time dependent motion 
of all the relevant length scales within a turbulent flow. Because the range of length scales increases with 
Reynolds number, it becomes prohibitively expensive in terms of computer resources to compute DNS of 
high Reynolds number flows. A method for overcoming this limitation is to directly solve for the large scale 
motions and to model the effect of the small scale motions. This approach is called large eddy simulation 
(LES). 

In LES the small scales are averaged out using a spatial filter. As a result of the averaging procedure 
some information is lost which must be replaced with a subgrid-scale (SGS) model. The SGS model provides 
a closure for the subgrid-scale stresses, much the same as the way an engineering turbulence model provides 
information on the Reynolds stresses. However, it is believed that the large scales vary widely from one flow 
to another while the small scales are more universal. Therefore, in LES one can use a simple SGS model 
and, since the large scales are solved for directly, the results of an LES should be more robust than those of 
an engineering turbulence model. 

In addition to its use as a research tool, large eddy simulation holds the promise of becoming a valuable 
engineering tool in analyzing problems that are too difficult for conventional engineering turbulence models 
to predict accurately. An example of such a problem is the inherently unsteady flow of a turbulent vortex 
interacting with a flexible structure, such as occurs on fighter aircraft at high angles of attack when the 
forebody vortices impact the tails. The unsteady loading on the tail structures causes premature cracking 
and is of serious concern to the Air Force. Another example in which LES is needed is the computation of 
turbulent jets and boundary layers to provide time dependent information for computational aeroacoustics 
calculations. An additional example is the use of LES in computing turbulent mixing in combustion problems, 
such as for fuel injectors in scramjet engines. 

The primary objective of the current research is to help validate a large eddy simulation code developed 
by Drs. Miguel Visbal, Datta Gaitonde, and Donald Rizzetta of the Air Force Research Lab. The LES code 
uses high-order compact finite difference schemes based on the work of Lele[7] and refined by Gaitonde et 
al.[4] and Visbal and Gaitonde[15]. This code is written for compressible flow in generalized coordinates. It 
needs to be validated and tested on a variety of flows. A series of test cases is outlined in the report for the 
1997 AFOSR Summer Research Program by Blaisdell[2]. The test case considered here is that of decaying 
compressible isotropic turbulence. 

The LES code uses currently available LES technology. However, as pointed out in [2], there are several 
unresolved issues regarding the formulation of the LES equations and the SGS models. Resolution of these 
issues is important to the future development of LES. The second objective of this research project addresses 
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the issue of the commutation of filters with derivatives. Recent work is examined for filters that commute 
with an error that is less than the truncation error of the differencing scheme used for the LES calculations. 

The purpose of this report is to summarize the progress made during the course of this project on these 
two objectives. The issues involved are not fully resolved at this point. Therefore, the report concludes with 
suggestions for future work. 

2 LES Code Validation: Compressible Isotropic Turbulence 

The simplest turbulent flow is decaying isotropic turbulence. LES of compressible isotropic turbulence were 
used by Moin et al. [8] to investigate their compressible formulation of the dynamic SGS model. Similar 
simulations were done by Spyropoulos & Blaisdell[12] and Spyropoulos[ll] to more thoroughly investigate 
the ability of the dynamic model to capture compressibility effects and to examine several other issues in 
applying the dynamic SGS model. 

The above LES and previous DNS of compressible homogeneous turbulence all used initial conditions 
which created artificial acoustic waves. Ristorcelli k. Blaisdell[9] have devised a method of producing initial 
conditions which are more consistent and which do not produce strong acoustic waves. Such initial conditions 
should be used in any future studies of compressible homogeneous turbulence. However, since our purpose 
here is to validate the LES code, initial conditions are chosen to match those of previous simulations. 

The governing equations and numerical method used in the code are described in Rizzetta et al.[10]. The 
LES code is called FDL3DI.LES. A detailed description of a version of the code which does not include an 
SGS model is given in Gaitonde and Visbal[5]. As mentioned above the code uses high-order compact finite 
difference schemes developed in [7, 4, 15]. It also uses a compact filter [7, 4, 15] in order to stabilize the 
simulations. The filter is applied directly to the solution at the end of each time step and, therefore, in the 
discussion below the procedure is referred to as solution filtering. It is believed that the filtering has an 
impact on the LES results, as discussed below. Some of the results for isotropic turbulence are included in 
Rizzetta et al., along with test results for channel flow and flow over a cylinder. Here more complete results 
are given for the isotropic test cases. 

2.1 Description of the Simulations 

The simulations considered here are compared to two cases from Spyropoulos & Blaisdell[12], Case 6 and 
Case CBC. Case 6 is a relatively low Reynolds number case with significant compressible fluctuations, while 
Case CBC is meant to reproduce the nearly incompressible experiment of Compte-Bellot and Corrsin[3]. 

The governing equations are nondimensionalized using a reference length, L, chosen so that the domain 
size has a length 27r on each side. The length scale of the turbulence is then defined by specifying the 
initial three-dimensional spectra. For Case 6, the initial spectra for the velocity, density and temperature 
fluctuations are of the form 

£'(«:) oc exp [—2 («:/k:p)^] (1) 

where k is the magnitude of the wavenumber vector, and the wavenumber of the peak of the spectrum Kp 
is set at 4. The proportionality constant in the spectra is adjusted to set the r.m.s. level of the velocity, 
density and temperature. The r.m.s. level of the velocity is specified so that the initial turbulent Mach 
number Mt = 0.4, where is defined as the ratio of the r.m.s. magnitude of the fluctuating velocity to the 
mean speed of sound. The reference velocity, Ur, in the nondimensionalization of the governing equations is 
chosen to be the initial mean speed of sound, so the reference Mach number is Mr = 1. In addition to the 
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turbulent Mach number, the velocity field is parameterized by the fraction of energy in the dilatational part 
of the velocity, x (see [12] for details). In the current simulations the initial value is x = 0.2. The initial 
scaled r.m.s. density is {p'^)/{pf = 0.032, and the initial scaled r.m.s. temperature is {r^)/{Tf = 0.005, 
where the angled brackets, ( ), indicate a volume average over the computational domain. The reference 
Reynolds number, Re, is set to 536.9, which corresponds to a turbulent Reynolds number Ret = 2157, where 

_ q'i/^eu) and {p)q^ = 2k = {pu'fu'l) is twice the turbulent kinetic energy (k or TKE), e is the TKE 
dissipation rate, and v is the kinematic viscosity. 

For the CBC case the initial velocity spectrum was chosen to match the experimental measurements of 
Compte-Bellot and Corrsin[3]. The initial turbulent Mach number was set to Mt = 0.3, the initial density and 
dilatational velocity fluctuations were zero. The pressure field was found by solving the Poisson equation 
for incompressible flow. The computational Reynolds number was Re = 1588, and the initial turbulent 
Reynolds numbers was Ret = 3166. 

Periodic boundary conditions are used for the isotropic simulations. Within the finite difference code, 
FDL3DI, periodic boundary conditions are implemented using a five-point overlap in each direction. In 
discussing grid sizes below, the five-point overlap region is not included in order to facilitate comparison 
with simulations using a spectral code which does not use an overlap region. 

Several LES runs were made of the two cases using the current code, FDL3DI. A description of the current 
runs is presented in Table 1, along with a description of previous runs which are used for comparison. The 
run name is used within this report to distinguish the simulations. The case name refers to one of the 
two cases, Case 6 or CBC from [12]. Under “Code” spectral refers to the Fourier spectral code used by 
Spyropoulos and Blaisdell[12], while FDL3DI refers to the current LES code. The spectral code used the 
dynamic SGS model; however, FDL3DI has the option of using either the dynamic model or the Smagorinsky 
model. The main difference is that with the Smagorinsky model the model coefficients are constant, while 
for the dynamic model they are determined dynamically from the simulation data. 

For the Smagorinsky model FDL3DI also includes the option of using different types of finite difference 
schemes to compute the strain rate used in the eddy viscosity. It is emphasized here that the lower order 
schemes were only used to compute the eddy viscosity and that all other derivatives were computed using 
the compact finite difference schemes presented in Gaitonde and Visbal[5]. All the possible finite difference 
scheme options were exercised for the LES presented in Table 1; however, only minor differences were 
found using the various schemes and, therefore, only the runs using the more accurate compact schemes are 
discussed in this report. 

2.2 Simulation Results: Time Histories 

Simulation 12 is used to validate FDL3DI as a DNS code. It is compared to results from the DNS II, which 
used the previously validated spectral code of [12]. The evolution of the turbulent kinetic energy is shown 
in Figure 1. Here time is normalized as t/ro, where ro = k/e is the initial value of the eddy turnover 
time. Excellent agreement is found. The evolution of the density variance is given in Figure 2. Again, the 
agreement between the two codes is excellent. These results give confidence that FDL3DI is solving the 
governing equations correctly, as a DNS code. 

Results from the 32® LES runs corresponding to Case 6,13,14 and 15, are shown in Figures 3 and 4 for the 
turbulent kinetic energy and density variance respectively. The results are also compared to the filtered DNS 
results from run 12. Here the DNS results have been filtered from a 128® grid onto a 32® grid. The turbulent 
kinetic energy for all of the LES simulations is below that of the filtered DNS. The LES results from the 


4-5 



spectral code are closer to the DNS data than the two finite difference code results. As is shown below by 
looking at spectra, the reduced TKE in the finite difference LES occurs because energy is diminished in the 
higher wavenumber range of the spectrum. It is believed that this is due to the explicitly applied filter, which 
removes energy from the higher wavenumbers in the finite difference calculation, although the effect of the 
filtering cannot be determined for certain without performing further simulations that do not use filtering. 
This point is discussed further throughout the report. The results for the density variance shown in Figure 4 
show trends that are generally similar to those seen for the TKE. 

Figures 5 and 6 show the TKE and density variance for the 48^ LES corresponding to Case 6,16 and 17. 
Note that the filtered DNS data is the same as that shown in Figures 3 and 4, because the DNS data was 
only projected onto a 32^ grid and not onto a 48^ grid. However, because the DNS is very well resolved, 
there is not a significant difference between the filtered DNS data and the unfiltered DNS data. Therefore, 
the available DNS data can still be used to give a meaningful comparison with the 48® LES results. Also, 
note that only 32® results are available from the spectral code. The 48® LES results for TKE lie closer to 
the DNS data than the 32® spectral code results, and they are much closer to the DNS data than the 32® 
finite difference code results. It seems that a finer grid is needed for the finite difference code in order to 
produce results as good ^ the spectral code. Again, the trends for the density variance are similar to those 
for the TKE. 

With the dynamic SGS model the model coefficients C, Cj, and Prt are computed from the simulation 
rather than being specified a priori. A comparison of the coefficients is presented for the Case 6 simulations 
in Figures 7-9. The model coefficient used in the eddy viscosity, C, is shown in Figure 7. The spectral code 
gives values that are similar to the constant value of 0.00846 used in with the Smagorinsky model. The 32® 
finite difference simulation produces values of C that are greater than those of the spectral code, while the 
48® simulation gives values that are closer to those of the spectral code. (Note that due to a post-processing 
error the values of C for the 48® finite difference LES presented in Figure 3 of [10] are too small by a factor 
of (48/32)® = 2.25. The agreement with the results of the spectral code is actually quite good.) 

The evolution of C/, which is used in the modeling of the trace of the normal SGS stresses, is given in 
Figure 8. However, since the finite difference code predicted negative values of Ci, the code set Cj to be 
zero. It is not known why the finite difference code produced negative values of C/ while the spectral code 
produced positive values. For the CBC case discussed below, the finite difference code produces positive 
values. 

The last of the model coefficients is the turbulent Prandtl number, Prt- This is plotted in Figure 9. Both 
of the finite difference LES produce values of Prt that are similar to those from the spectral code, with the 
48® results being closer than the 32® results. All of the dynamic model calculations are well below the value 
of 0.9 used in the Smagorinsky model. 

The CBC case is at higher Reynolds numbers than Case 6. The history of the TKE and the density 
variance is shown in Figures 10 and 11 respectively. The finite difference LES results are compared with the 
spectral LES results. The spectral LES were compared previously with the experimentally measured spectra 
in Spyropoulos and Blaisdell[12], but the experimental data is not included here. DNS data for the CBC 
case has not been published, although a simulation has been done using a 512® grid (Alan Wray, NASA 
Ames, private communication). There is not a significant difference between the two finite difference LES 
simulations, indicating that differences in the SGS model do not have a large impact for these runs. The 
times corresponding to where experimental measurements were taken are t/ro = 0.2870 and 0.6612. At these 
times there is not a large difference in the values of the TKE among the three LES simulations. However, 
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there is a significant difference in the history of the TKE at early times between the spectral LES and the 
finite difference LES. The spectral LES displays a negative curvature in the time history of k. This is typical, 
and is due to the fact that the simulation begins with artificial initial conditions that do not have the proper 
phase relationships among the Fourier modes. It takes some time to establish the nonliner transfer of energy 
to the small scales, and this is reflected in the relatively slow decay at early times. The finite difference 
simulations on the other hand display a very rapid decay of TKE at early times. It is believed that this is 
due to the solution filtering. The initial spectrum (see the spectra results presented in the next section) is 
full, because this is a high Reynolds number case; however, the solution filtering instantly removes energy 
from the highest wavenumber range at the end of the first time step, and more energy is removed with each 
subsequent time step. A test without filtering is needed to be certain of what is happening, but the observed 
behavior seems consistent with this explanation. 

The history of the density variance for the CBC ceise is given in Figure 11. The variance starts at zero since 
the initial conditions are incompressible. As with the TKE there is little difference among the simulations 
at later times, while the finite difference LES have lower fluctuation levels at early times compared to the 
spectral LES results. 

The evolution of the three model coefficients for the CBC runs are shown in Figures 12-14. The values of 
C from both of the dynamic SGS model simulations shown in Figure 12 are larger than the constant value 
of 0.00846 used in the Smagorinsky model. The value for the finite difference LES is somewhat higher than 
that of the spectral LES. 

The values of C/ are shown in Figure 13. In contrast to the Case 6 runs, the finite difference code 
computed a positive value of Cj. The values produced by the finite difference code are close to that assumed 
in the Smagorinsky model, while the spectral code has a much larger value of Cj. As pointed out in [12] the 
values of Ci are sensitive to the test filter, and for some filters a numerical instability can result. Since the 
test filter used in the finite difference code (see [10]) is not the same as the sharp cut-off filter used in the 
spectral calculations, it is not surprising that the values of Ci are different. 

The turbulent Prandtl number, Prt, is shown in Figure 14. As with the Case 6 runs the dynamic SGS 
model gives values of Prt that are lower than the value used in the Smagorinsky model. The finite difference 
code produces somewhat lower values of Prt than the spectral code. 

2.3 Simulation Results: Spectra 

In order to get a better understanding of the behavior of the turbulence in the LES, three-dimensional 
velocity spectra, E{k), were computed. The spectrum represents how the energy in the velocity fluctuations 
is distributed over wavenumbers (length scales). The three-dimensional spectrum gives the energy within 
spherical shells in wave space, and k is the magnitude of the wavenumber vector, k — {kx, Ky,Kz)- Figure 15 
shows the spectra for the DNS run 12 and the two 32^ finite difference LES runs 14 and 15 at time t/ro — 
0.2985. There is very little difference between the two LES results, which indicates that there is little 
difference between the effect of the two SGS models for this case. In order to check whether the SGS is 
having any effect, a coarse grid DNS should also be run (on a 32^ grid with no SGS model), however, this 
was not done. The LES spectra are close to that of the DNS for low wavenumbers; however, at higher 
wavenumbers there is significantly less energy in the LES spectra. This reduction in energy in the smaller 
scales results in the TKE for the LES being smaller than that in the DNS, as shown in Figure 3. It is 
believed that the reason the finite difference results are reduced for the higher wavenumbers is the solution 
filter that is applied in the finite difference code. 
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The spectra for the 48^ LES are shown in Figure 16. The trends are similar to those seen for the 32^ 
simulations; however, the reduction in the energy of the small scales is much less because of the greater 
resolution. This trend is consistent with the TKE seen in Figure 5. 

The spectra for the CBC case are shown in Figures 17 and 18 at times t/ro = 0.2870 and 0.6612 
respectively. As for the Case 6 LES, there is little difference between the results using the two SGS models. 
However, for this high Reynolds number case the difference between the finite difference LES spectra and the 
spectral LES code spectra are very large. The finite difference LES spectra are greatly distorted. The energy 
in the high wavenumbers (small scales) is reduced, while there is too much energy in the low wavenumbers 
(large scales). Even though the values of the TKE shown in Figure 10 are not that different among the 
simulations at the times when the spectra are computed, there is a significant difference in how the energy 
is distributed. This is a good indication that one cannot judge an LES solely by integrated quantities such 
as the TKE. It is important to consider spectra also. 

It is believed that the behavior of the finite difference LES is due to the solution filtering, although 
further tests are needed to be certain of it effects. The filtering removes energy from the high wavenumber 
end of the spectrum. For this high Reynolds number case, where the spectrum is full (see the spectrum 
from the spectral code in Figures 17 and 18), the loss of energy in the higher wavenumber resolved scales 
affects the nonlinear transfer of energy from the lower wavenumber resolved scales. The nonlinear transfer 
of energy across some wavenumber, k^, occurs mostly between wavenumbers that lie close to Ke- If the 
modes with wavenumber greater than Kc are reduced in amplitude, then the nonlinear transfer of energy 
from wavenumbers smaller than «c are inhibited, and energy will tend to pile up in the wavenumbers just 
below Kc. It is believed that this pile-up of energy is responsible for the excessive energy present in the 
low wavenumber region of the finite difference LES spectra. This possibility is explored further in the next 
section. 

2.4 Discussion on the Effects of Filtering 

Much of the discrepancy between the finite difference LES results and the spectral code LES results presented 
in the previous section seems to be due to the solution filtering procedure used in the finite difference code. 
In order to be certain of the effect, it is necessary to do simulations without the filter, or with different 
filter parameters. However, since such simulations have not been done at this time, our discussion must be 
restricted to the expected effect of the filter. 

It should be clarified at this point that the decaying isotropic LES may be able to be done without solution 
filtering, or with filtering only applied infrequently. However, since filtering is needed in more complex flows 
for numerical stability, its effects should be fully understood. Examining the effects of filtering on isotropic 
turbulence may help with that understanding. Also, if the solution to the problem of the effect of filtering 
is to apply it only after a certain number of time steps, then the frequency of applying the filter becomes 
an ad hoc parameter, unless the frequency of application can somehow be related to the properties of the 
solution in a manner similar to how the model coefficients are determined in the dynamic SGS model. 

The filter used for all of the current simulations has the filter parameter, a/, set to aj = 0.49. The filter 
transfer function, G{k), is shown in Figure 19. With this value of the filter parameter, the filter is highly 
optimized. Therefore, the filter transfer function is fairly sharp, and its effect is mostly limited to a narrow 
high wavenumber region. Because the filter is applied at the end of each time step, it can have a cumulative 
effect. The other curves in Figure 19 show the effect of applying the filter multiple times. This is found 
simply by computing powers of the filter transfer function, [G(k:)]^, where N is the number of times the 
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filter is applied. As one would expect, the more the filter is applied the greater its effect on the large scales, 
and the more the effective cutoff wavenumber moves to smaller wavenumbers. 

For a linear problem with constant coefficients, the effect of the application of a solution filter can be 
predicted in the above way, and N is just the number of time steps. However, for nonlinear problems (such as 
turbulence simulations) or problems with nonconstant coefficients, energy is transferred among wavenumbers, 
so that in addition to the simple multiplicative effect of repeated application of the filter, the solution also 
depends on the transfer of energy from other wavenumbers. For decaying isotropic turbulence, applying the 
filter multiple times will greatly diminish the energy in the high wavenumber part of the spectrum, but the 
nonlinear transfer of energy will tend to feed energy back in. So the effective filter transfer function will be 
different than [G(/c)]^, and in fact it may depend on wavenumber. 

In order to make a rough estimate of the effective transfer function for the solution filtering procedure, 
the ratio of the spectrum from the Case 6 finite difference LES runs (using the dynamic SGS model) to the 
spectrum from the DNS run 12 is computed and shown in Figure 20 for the LES runs using a 32^ grid and 
a 48^ grid. It has a general shape similar to the filter transfer function of the solution filter applied many 
times. This is only a qualitative comparison, but it gives an indication that explanation for the depleted 
energy in the smaller resolved scales may be correct. 

In Spyropoulos[ll] the use of solution filtering with a compact finite difference scheme was examined 
(see Figure 3.20 therein). It was found that without filtering a 32^ LES using a compact finite difference 
scheme had a pile-up of energy in the highest wavenumbers. LES were performed on a 48^ grid with solution 
filtering using a sharp cut-off (SCO) filter, where the cut-off wavenumber was chosen to be 16 to make the 
simulation equivalent to a 32® simulation. The results showed excellent agreement with a 32® spectral LES. 
However, an important difference between the current filter used in FDL3DI and the SCO filter used in [11] 
is that the filter transfer function for the SCO filter is the same no matter how many times it is applied. For 
complex flow problems one cannot use the SCO filter, which is defined in Fourier wave space; one can only 
approximate it, as with the filter used in the current LES. 

3 Commutation of Filtering and Differentiation 

A fundamental question regarding LES was raised by Ghosal & Moin[6] who pointed out that for nonuniform 
grids the filtering operation does not commute with differentiation as is generally assumed when the LES 
equations are derived. They were able to devise a filtering method that commutes with differentiation up to 
second order in the filter width. However, this is a difficulty if one wishes to use high-order finite difference 
schemes, such as the compact differencing schemes used here. A filter which commutes to arbitrary order 
was developed by van der Ven[13]; however, it is limited to infinite domains with no boundaries. A survey 
of the issue and an attempt to create discrete filters which commute with discrete derivatives is given by 
Blaisdell[l]. 

In a recent paper Vasilyev[14] developed a set of filters which commute with finite difference schemes 
with an error of a certain order. The filter can be chosen so that the order of the commutation error is higher 
than the order of the truncation error of the difference scheme. In this way the commutation error will be 
negligible relative to the truncation error. The filters of Vasilyev are quite flexible and compact filters are 
available which have filter transfer function similar to those used in the current study. 

As part of the current project the explicit (i.e. noncompact) filters of Vasilyev were tried on some test 
problems. Following [14] the test function is a 16th order Chebyshev polynomial defined on a stretched 
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hyperbolic tangent grid. The coefficients of the filters are given in Table I of the reference, although the 
table is incomplete and it was necessary to derive some of the needed coefficients. The finite difference 
scheme chosen is 4th-order central with 4th-order boundary closures. Figure 21 shows the results for the 
discretization (truncation) error (DE) and the commutation error (CE). The r.m.s. error is plotted, and 
the maximum error behaves similarly. The number after the CE indicates the order to which the filter 
is supposed to commute. The 2nd-order commuting filter gives an error (CE2) that is greater than the 
truncation error of the finite difference scheme. The 4th-order filter produces an error that is comparable 
to the truncation error. The 6th- and 8th-order commuting filters both give commutation errors that are 
significantly less than the truncation error. 

Figure 21 can be compared to Figure 10 of the reference. Note that in Figure 21, for a large number of 
grid points, the commutation error actually scales with the order of the difference scheme rather than the 
advertised order. However, the level of the commutation error is significantly less than the truncation error, so 
that the difference in the scaling is inconsequential. In discussing the discrepancy with Dr. Vasilyev (private 
communication, 1998), it was found that the difference may be that he did not consider the commutation 
error coming from regions near the boundary in the test cases he ran, whereas the commutation error is 
greatest near boundaries due to the boundary treatments necessary in both the difference scheme and the 
filter. 

In principle, the commuting filters of Vasilyev can be used so that commutation errors are not a concern. 
However, within the LES community this is not being done. The issue of commutation errors is basically 
being ignored. (Some researchers state that the commutation error is wrapped into the SGS model, but this 
amounts to ignoring the issue.) It may be that commutation errors are not significant. A few researchers 
that I have contacted informally have computed the commutation error from their LES data and found that 
it is small relative to the other terms in the LES equations. This is not quite the same as computing the 
commutation error based on fully resolved data, but it does give an indication of whether commutation errors 
should be of concern. In general, it is recommended that LES include computation of the commutation error 
as a check. 

One point that needs to be kept in mind is that the manner in which filtering is used to derive the LES 
equations is not the same as the the solution filtering procedure applied in the current finite difference code. 
The difference between the two remains unclear at this point. 

4 Conclusions and Suggestions for Future Work 

The LES code, FDL3DI.LES, developed at the Air Force Research Lab has been applied to the problem 
of decaying compressible isotropic turbulence. The code uses a high-order compact finite difference scheme 
coupled with a high-order compact spatial filter. Comparisons were made with previous results from a 
Fourier spectral code. 

The finite difference code was first tested on a DNS problem and excellent agreement was found with the 
results of the spectral code. LES were run for a low Reynolds number case, corresponding to the DNS test 
ceise, and a high Reynolds number case, corresponding to the experiment of Compte-Bellot and Corrsin. For 
the low Reynolds number case, the finite difference simulations on a 48^ grid seem to agree well with the 
32^ LES from the spectral code. The turbulent kinetic energy and the density variance histories agree well. 
The values of the SGS model coefficients C and Pr^ are also similar between the two codes. The coefficient 
Cl differs between the two codes; however, it is noted that this coefficient is sensitive to the type of test 
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filter used to compute it, which may account for the difference. The 32^ LES of this case showed too rapid 
a decay of the turbulent kinetic energy. Examination of spectra show that this behavior is due to a loss 
of energy from the smaller resolved scales. It is believed that this is due to the effect of the solution filter 
applied in the finite difference code, although it may also be that the 32^ grid is too small to use with the 
finite difference method for this problem. 

For the high Reynolds number case only 32^ LES were performed. The history of the turbulent kinetic 
energy from the finite difference LES shows very rapid decay decay at early times, but shows good agreement 
with the spectral code LES results at later times. However, examination of the spectra reveals that the finite 
difference LES spectra are highly distorted, with too much energy in the large scales and too little energy in 
the small scales. Again, it is believed that this behavior is due to the solution filtering procedure, although 
further tests are needed to be certain. It is also likely that the finite difference code will require a finer grid 
in order to perform an adequate LES of this high Reynolds number case. 

As part of the project, the issue of the commutation of filtering and differentiation was considered. The 
filters recently proposed by Vasilyev were considered. It was verified on a test problem that the higher order 
commuting filters produce a commutation error that is less than the truncation error of the finite difference 
scheme used. Although, these filters hold promise, they are not widely used. Also, there is an unsettled issue 
as to how the solution filtering procedure used here is related to the filtering assumed in the derivation of 
the LES equations, which is where the commutation issue arises. 

There are several tasks that should be done to complete and extend the current work. First, the simulation 
cases considered did not include coarse grid DNS runs. These should be done in order to determine whether 
the SGS model has a significant effect or not. It also would give an indication of the effect of the numerical 
method. The simulations should be rerun without solution filtering, so that it can be determined whether 
the observed discrepancies between the finite difference LES and the spectral code LES are due to excessive 
filtering of the solution. Also, the Compte-Bellot and Corrsin case should be redone using 48® and a 64® 
grids. 

An extension of the current research that should be useful is to investigate the effect of solution filtering 
in more detail. Different filter parameters or different types of filters can be used. The filter can be applied 
at various intervals instead of after each time step. The filter interval then becomes an additional parameter; 
however, it may be possible to find a rational way to determine the filtering frequency. Lastly, the relationship 
between solution filtering and the filtering assumed in deriving the LES equations should be considered. 


References 

[1] Blaisdell, G. A., “Commutation of Discrete Filters and Differential Operators for Large-Eddy Simu¬ 
lation,” Advances in DNS/LES, Proceedings of the First AFOSR International Conference on Direct 
Numerical Simulation and Large Eddy Simulation, Louisiana Tech Univ., Ruston, LA, USA, August 
4-8, 1997, pp. 333-340, Greyden Press, Columbus. 

[2] Blaisdell, G. A., “A Review of Benchmark Flows for Large Eddy Simulation,” Final Report for the 
AFOSR Summer Faculty Research Program, Wright Laboratory, August 1997. 

[3] Compte-Bellot, G. and Corrsin, S., “Simple Eulerian Time Correlation of Full and Narrow Band Velocity 
Signals in Grid Generated, Isotropic Turbulence,” J. Fluid Mech., Vol. 48, pp. 273-337, 1971. 


4-11 





[4] Gaitonde, D. V., Shang, J. S., and Young, J. L., “Practical Aspects of High-Order Accurate Finite- 
Volume Schemes for Electromagnetics,” AIAA Paper 97-0363, Jan. 1997. 

[5] Gaitonde, D. V. and Visbal, M. R., “High-Order Schemes for Navier-Stokes Equations: Algorithm and 
Implementation into FDL3DI,” AFRL-VA-WP-TR-1998-3060, Air Force Research Laboratory, Wright- 
Patterson AFB, OH, Aug. 1998. 

[6] Ghosal, S. and Moin, P., “The basic equations for the large eddy simulation of turbulent flow in complex 
geometry,” J. Comp. Phys., Vol. 118, pp. 24-37, 1995. 

[7] Lele, S. K., “Compact finite difference schemes with spectral-like resolution,” J. Comp. Phys., Vol. 103, 
pp. 16-42, 1992. 

[8] Moin, P., Squires, K., Cabot, W., and Lee, S., “A Dynamic Subgrid-Scale Model for Compressible 
Turbulence and Scalar Transport,” Phys. Fluids A, Vol. 3, pp. 2746-2757, November 1991. 

[9] Ristorcelli, J. R. and Blaisdell, G. A., “Consistent Initial Conditions for the DNS of Compressible 
Turbulence,” Phys. Fluids, Vol. 9, pp. 4-6, January 1997. 

[10] Rizzetta, D. P., Visbal, M. R., and Blaisdell, G. A., “Application of a High-Order Compact Difference 
Scheme to Large-Eddy and Direct Numerical Simulation,” AIAA Paper 99-3714, June 1999. 

[11] Spyropoulos, E. T., “On Dynamic Subgrid-Scale Modeling for Large-Eddy Simulation of Compressible 
Turbulent Flows,” Ph.D. Thesis, School of Aeronautics and Astronautics, Purdue Univ., West Lafayette, 
Indiana, December 1996. 

[12] Spyropoulos, E. T. and Blaisdell, G. A., “Evaluation of the Dynamic Model for Simulations of Com¬ 
pressible Decaying Isotropic Turbulence,” AIAA J., Vol. 34, pp. 990-998, May 1996. 

[13] van der Ven, H., “A family of large eddy simulation (LES) filters with non-uniform filter widths,” Phys. 
Fluids, Vol. 7, pp. 1171-1172, 1995. 

[14] Vasilyev, Oleg V., Lund, Thomas S., Moin, Parviz, “A General Class of Commutative Filters for LES 
in Complex Geometries,” J. Comp. Phys., Vol. 146, No. 1, pp. 82-104, 1998. 

[15] Visbal, M. R. and Gaitonde, D. V., “High Order Accurate Methods for Unsteady Vortical Flows on 
Curvilinear Meshes,” AIAA Paper 98-0131, Jan. 1998. 


4-12 




Table 1: Case descriptions for decaying isotropic turbulence simulations. 


Run 

Case 

Code 

DNS/LES 

SGS Model 


11 

Case 6 

spectral 

DNS 

none 


mm 

Case 6 

FDL3DI 

DNS 

none 

IFBSl 

13 

Case 6 

spectral 

LES 

dynamic 

mm 

14 

Case 6 

FDL3DI 

LES 

dynamic 

wm 

15 

Case 6 

FDL3DI 

LES 


wsm 

16 

Case 6 

FDL3DI 

LES 

dynamic 

mm 

17 

Case 6 

FDL3DI 

LES 

Smagorinsky 


18 

CBC 

spectral 

LES 

dynamic 

wm 

19 

CBC 

FDL3DI 

LES 

dynamic 

MM 

no 

CBC 

FDL3DI 

LES 

Smagorinsky 
























































0.08 



Figure 1: Evolution of the turbulent kinetic energy for the DNS runs, II and 12. 



Figure 2: Evolution of the density variance for the DNS runs, II and 12. 
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Figure 3; Evolution of the turbulent kinetic energy for the 32^ LES runs, 13, 14 and 15 compared to the 
filtered DNS data from run 12. 



Figure 4: Evolution of the density variance for the 32^ LES runs, 13, 14 and 15 compared to the filtered DNS 
data from run 12. 
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Figure 5; Evolution of the turbulent kinetic energy for the 48^ LES runs, 16 and 17, and the 32^ LES run 13 
compared to the filtered DNS data from run 12. 
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Figure 6: Evolution of the density variance for the 48^ LES runs, 16 and 17, and the 32^ LES run 13 compared 
to the filtered DNS data from run 12. 













Figure 7; Evolution of the model coefficient C for the dynamic Case 6 LES runs, 13,14 and 16, compared to 
the constant value used in the Smagorinsky model. 



Figure 8: Evolution of the model coefficient Cj for the dynamic Case 6 LES run, 13, compared to the constant 
value used in the Smagorinsky model. (The values for runs 14 and 16 are zero.) 
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Figure 11: Evolution of the density variance for the CBC LES runs, 18, 19 and 110. 
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Figure 12: Evolution of the model coefficient C for the dynamic CBC LES runs, 18, 19 and 110, compared 
to the constant value used in the Smagorinsky model. 
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Figure 13: Evolution of the model coefficient Cj for the dynamic CBC LES runs, 18, 19 and 110, compared 
to the constant value used in the Smagorinsky model. 
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Figure 14; Evolution of the turbulent Prandtl number Pr* for the dynamic CBC LES runs, 18, 19 and 110, 
compared to the constant value used in the Smagorinsky model. 












Figure 15: Three-dimensional velocity spectrum for the 32^ Case 6 LES, 14 and 15, compared with the DNS 
12 at time t/ro = 0.2985. 



Figure 16; Three-dimensional velocity spectrum for the 48^ Case 6 LES, 16 and 17, compared with the DNS 
12 at time </ro = 0.2985. 
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Figure 17: Three-dimensional velocity spectrum for the CBC LES, 19 and 110, compared with the spectral 
code LES 18 at time t/ro = 0.2870. 



Figure 18: Three-dimensional velocity spectrum for the CBC LES, 19 and 110, compared with the spectral 
code LES 18 at time t/ro = 0.6612. 
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Figure 19: Filter transfer function for the solution filter used in FDL3DI (with a; - 0.49) applied multiple 
times. 



Figure 20; Ratio of the three-dimensional velocity spectrum from the Case 6 LES, 14 and 16, to that from 
the DNS 12. 
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Abstract 


Many animals use panoramic visual motion cues to monitor their progress through the 
world and correct deviations from their intended courses. A computational model inspired by 
features of insect visual systems was developed to examine the roles of both physiological tuning 
properties and network architecture in generating drcuits that are selective for optic flow field 
type and position. Arrays of small-field motion detectors (SFMDs) endowed with various 
fimctional properties (selectivity to motion direction and speed; purely excitatory outputs or 
excitatoryfinhibitory outputs) were tested with alternative spatial synaptic output patterns to 
evaluate what features of SFMD outputs are most important for generating wide-field optic flow 
selectivity. The biomimetic flow-selective properties exhibited by this model include “coarse 
coding” for optic flow type, maintenance of selectivity for optic flow t5^e across a range of 
optic flow speeds, and sensitivity to optic flow position relative to receptive field center of a 
flow-selective collator neuron or summating element. The key design features are a simple feed¬ 
forward network composed of two basic stages: (a) arrays of sfinds that respond according to 
temporal delays determined by motion direction and speed, and (b) simple spatial patterns of 
purely excitatory synaptic coimection strengths that can be designed to generate selectivity for a 
wide variety of flow fields. Flow field selectivity can be produced by optimizing either of two 
basic design features: the response properties of the sfinds and the spatial synaptic output 
mappings to the collator stage. Design parameters are flexible: if roughly optimal values for 
either of the two basic features are met, robust model behavior is obtained without the need for 
precise control of the other parameters. The results of this investigation suggest specific design 
strategies for man-made optic flow analyzers. These strategies are applicable to flow analysis in 
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general, and therefore can be extended to other types of spatial processing tasks, including 
tracking and pursuit of small mobile targets. 

Introduction 

This report describes a computational model for optic flow processing that was inspired 
by known features of motion processing circuitry in the visual systems of flies (dipteran insects). 
The model was developed as a result of the author’s participation in the AFOSR summer feculty 
research program at Eglin AFB, FL, where several basic information processing strategies 
employed by insect visual systems were identified for possible applications to machine vision 
hardware (Douglass, 1997). In particular, the optic flow processing model described below was 
designed to elucidate a basic issue in neuronal information processing; to what extent are 
neuronal circuits based on broad parametric tuning (coarse-coding) mechanisms as opposed to 
sharp tuning, or “matched filtering”? Previous theoretical work based on insect circuits that 
process directional wind inputs has suggested that directional information transfer is maximized 
when the direction-sensitive circuit elements exhibit a single optimal tuning curve bandwidth 
(Theunissen and Miller, 1991). The model visual processing circuit presented here challenges 
this view, by demonstrating that the components of the circuit need not be restricted to a single 
directional tuning curve width in order to generate selectivity for optic flow type and position. 

This model is designed to examine the properties of alternative configurations of 
idealized optic flow processing circuitry. Although it is based on known features of insect visual 
pathways, many relevant details of this system are unknown or only inferred at present. Thus, 
the goal is not to faithfully imitate the actual circuitry, but to generate testable predictions and 
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elucidate important general requirements for optic flow processing. The results of this 
investigation suggest specific design strategies for man-made optic flow analyzers. These 
strategies are applicable to flow analysis in general, and therefore can be extended to other types 
of spatial processing tasks, including tracking and pursuit of small mobile targets. 

Biological Bases for the model. Conceptual models of biological optic flow processing 
mechanisms generally fall into two categories, based on the types of input features that are used 
to generate flow field selectivity. These two alternative categories are known as the “direction 
mosaic hypothesis” and the “vector field hypothesis” (Duffy and Wurtz, 1991). The direction 
mosaic hypothesis postulates an array or mosaic of small-receptive field neurons or circuits, each 
small-field unit being sensitive to local motion direction and having its outputs arranged to yield 
large-field selectivity for a particular type of optic flow. This type of mechanism should 
inherently be sensitive to the portions of circular or radial flow fields with respect to the 
receptive field of the circuit. In contrast, the vector field hypothesis relies on responsivenes to 
fundamental features of particular types of optic flow, features that are distributed throughout 
the flow field and often are less position-dependent. These features include the divergence 
among local vector directions in radial fields, the curvature, or “curl” of circular fields, and the 
gradient in local vector magnitudes that is present within unidirectional, or “planar” flow fields 
(c.f Koenderink, 1986). 

Although there is some evidence for the operation of both the direction mosaic and 
vector field strategies in primate cortical circuitry (Duffy and Wurtz, 1991), the evidence from 
insects favors the direction mosaic hypothesis (Krapp and Hengstenberg, 1997). In both insects 
and vertebrates, various Unes of evidence suggest that the fundamental building blocks for optic 
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flow analysis comprise two basic stages. The first stage consists of retinotopically mapped 
arrays of small-field-of-regard, direction-sensitive motion detectors, which obtain their inputs 
fi-om an photoreceptors and peripheral visual intemeurons, and which provide their outputs, in 
the second stage, to wide-field-of-regard collator neurons (c.f Douglass and Strausfeld 1995; 
Morrone et al., 1995). The model described here also contains two basic stages, and is based on 
the direction mosaic h 5 T)Othesis. 

Several additional features of the model are based on known features of insect visual 
processing circuitry. In flies, a retinotopically mapped array of small-field neurons known as T5 
cells corresponds to the small-field motion detectors in the 1st stage of the model. Intracellular 
recordings fi-om identified T5 neurons have demonstrated their directional selectivity to motion 
(Douglass and Strausfeld, 1995), and a number of investigations (reviewed by Strausfeld, 1989; 
Strausfeld and Lee, 1991) show that they provide major inputs to wide-field collator neurons in 
a visual processing center known as the lobula plate. Many of the wide-field collators are 
sensitive to directional motion, and at least a subset is selective for complex optic flow fields 
(Krapp and Hengstenberg, 1996, 1997). There is strong anatomical evidence for the existence 
of four superimposed arrays of T5 cells within each retinotopic column, or visual sampling point 
(Strausfeld and Lee, 1991), thus, the model below specifies four arrays of small-field motion 
detectors (sfinds). Each array of sfinds produces maximal responses during stimulation by 
motion in a single “preferred direction” (PD), and the four PDs are oriented at 90-degree 
intervals. In flies, individual T5 neurons tend to have their outputs segregated into one of four 
distinct anatomical layers in the lobula plate (Fischbach and Dittrich, 1989; Strausfeld, 1989) 
that are individually sensitive to progressive, regressive, upward and downward motion 



(Buchner et al., 1984). Both the physiological responses and anatomical stratification of various 
identified neurons with lobula plate inputs are consistent with a four-layered functional 
organization of the lobula plate (e.g. Eckert 1982; Douglass and Strausfeld 1996,1998). 

Methods 

The model defines a feed-forward network with optic flow fields as its inputs, and 
responses of wide-field collator neurons as its outputs. The source code was written in Turbo 
Pascal (Borland International, Inc ). Based on the user’s selection of model parameters, the 
model first defines a particular optic flow field and flow field position. It then computes 
responses of arrays of local motion detectors (sfinds); these responses serve as the inputs to 
model wide-field collator neurons that are designed to exhibit selectivity for a particular type of 
non-uniform optic flow (expansion, contraction, clockwise (CLW) rotation, counterclockwise 
(CCLW) rotation) based on a spatial pattern of synaptic output connection strengths fi^om the 
sfinds. 

Stimuli. The optic flow stimulus (Fig. 1, at left) is defined as a spatial array of local 
motion vectors. Each vector is defined by an (x,y) location within a plane orthogonal to the line 
of sight of a viewer, an eye, or a visual sensor, and by a local motion amplitude r and direction 
0. Flow fields can be specified as one of five cardinal flow field types (pure clockwise rotation, 
counerclockwise rotation, expansion, contraction, or unidirectional flow), or as combinations of 
these cardinal types. In addition, the location of a flow field can be varied in relation to the 
receptive field center and receptive field diameter of the collator neuron. In its current 
implementation, local flow vectors are represented as their projections from 3-dimensional space 
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onto a 2-(iimensional plane. This method is reasonably accurate for modelling moderately wide 
visual fields that approach hemispherical views, in future implementations, the stimulus 
definitions will be generalized to encompass the nearly panoramic fields of view that insects 
possess and which are desirable to emulate in hardware. 

For each sfind within the receptive field of the collator neuron, the model defines a 
stimulus vector direction (0) and magnitude A according to (1) the type of optic flow field, and 
(2) the distance r from the flow field center to the sfmd sampling point in visual space (This is an 
approximation to the small region of visual space which is actually sampled by the small-field 
inputs to an sfmd circuit; see below). The local stimulus direction 0, in degrees, is defined as the 
difference between the local flow direction and the preferred direction (PD) of the sfmd: 
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0 = arcsin(y/r) - (PD+90) + c. 


xi>0 


(1) 


0 = (PD+90) + c - arcsin(y/r), xi <= 0. 

Xi and y are the horizontal and vertical components, respectively, of r, and the value of c 
determines the type of optic flow field (c = 0 for pure clockwise rotation, c = 90 for pure 
expansion (looming), and so on). 

As illustrated at the left of Fig. 1, the local flow field vector magnitude A is proportional 
to its distance R from the optic flow field center and a flow field speed factor k. (K is 
translatable to rpm for rotational fields; deg./s for expanding or contracting fields): 

A = kR (2) 


Small-field motion detectors. 

The small-field motion detectors (sfinds) depicted in Figure 1 were modeled in two 
ways. First, because appropriately detailed information on the response properties of actual 
sfinds were unavailable, a detailed circuit model of insect sfinds was used to generate responses 
to motion direction and speed that can be considered biologically plausible. Second, a purely 
analytical model was developed that captured the major features of the detailed model, yet could 
be conveniently altered in predictable ways. The analj^ical model was used as the fiirst stage in 
the wide-field optic flow model of Figure 1. 

Detailed SFMD Model hosed on Hassenstein-Reichardt Correlation -type Detectors. 

The foundation for the more-detailed sfind model is the classical Hassenstein-Reichardt 
correlation-type elementary motion detector (emd) circuit (Hassenstein and Reichardt, 1956; 
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Borst and Egelhaaf, 1989), which employs two light-sensitive input channels with slightly 
different viewing angles, a fixed internal temporal delay or “circuit delay” in one channel, and a 
mathematical multiplication of the outputs from the two channels. In the present implementation 
(Fig. 2), a single edge motion stimulus is modeled as the sequential arrival of an increase or 
decrease in light intensity at the two channels, the external delay in the stimulus arrival at the 
two channels being determined by the motion speed and the angular distance between the 
channels. The responses of each charmel to changes in light intensity are defined using 
exponential functions of time (J.K. Douglass, unpublished) that are designed to mimic typical 
intracellular voltage responses to flicker observed in intracellular recordings from various 
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peripheral, non-motion-selective visual intemeurons of calliphorid flies (e.g. Laughlin, 1981; 
Douglass and Strausfeld, 1995). Beyond the level of the photoreceptors, these responses 
typically consist of a hyperpolarization to ‘light ON” and a depolarization to “light OFF,” and 
both responses are characterized by a rapid onset and relatively slow decay toward the baseline 
voltage. 

The inputs to neighboring emd channels were defined to be 2® apart, and the fixed circuit 
delay was set at 12 ms, with an additional cable conduction delay of 8 ms within each channel. 
The emd response was defined as a non-spUcing intracellular voltage, integrated over an interval 
of several ms after the arrival of a single edge at either channel. In order to produce both 
positive (excitatory) and negative (inhibitory) responses, the outputs from two mirror- 
symmetrical emds having opposite “preferred motion direction axes” were subtracted (Borst and 
Egelhaaf, 1990), and this result was taken to represent the response of a single sjmd. The 
preferred/null axis of this type of emd is the line defined by the two input channels. Motion of an 
edge that is orthogonal to the preferred/null axis produces a direction-dependent response 
(Grzywacz et al., 199). An alternative circuit configuration was also tested to illustrate the 
motion responses of a more spatially-distributed circuit (see Fig. 2B and Results), in which the 
outputs from two pairs of emds, their preferred/null axes separated by 60° (Buchner, 1976), 
were summed to produce the response of an sfind. 

Analytical SFMD Model. 

The first stage of the optic flow model consists of sfind responses to local motion vectors. The 
responses of each sfind were defined as a fUnction of motion direction (0) and speed (S). 
Responses to motion speed were modelled as an exponential function. 
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r(S) = k*S* e . P) 

The shape of this function (see Results, Fig. 5E) provides a good qualitative fit to speed 
response data that have been obtained during intracellular recordings fi-om wide-field collator 
neurons in the lobula plate of flies (Eckert 1980, 1982; Hausen 1982, Hengstenberg 1982), as 
well as single-unit recordings fi-om directional motion - sensitive neurons in area MT of the 
visual cortex of macaque monkeys (Maunsell and Van Essen, 1983). This approximate shape 
also is predicted fi-om sfind models during motion along the preferred/nuU axis (Results, Fig. 3). 

To investigate the effects of sfind directional tuning properties on the model output, the 
speed response function was convolved with one of four alternative cosine-shaped directional 
tuning functions f(0) (Results, Fig. 5 A-D), chosen to encompass a broad range of possible 
characteristics of biolo^cal sfinds. The alternative directional functions are either broadly 
(fA(e), fB(0) ), or narrowly (fc(0), fD(0) ) tuned, and produce either monophasic (purely 
excitatory; fA(0), fc(0) ) or biphasic (both excitatory and inhibitory, fB(0), fD(0) ) responses as a 
function of motion direction. Identified sfind output neurons in the fly brain (T5 cells, Douglass 
and Strausfeld, 1995) exhibit broad, combined excitatory/inhibitory responses to motion. 
Monophasic (e.g. purely excitatory) responses are also possible, as occurs in spiking neurons 
with little or no background activity, and as is also predicted for sfind circuits that lack a 
subtraction stage, or in which the integration stage involves a biophysical mechanism known as 
shunting inhibition. The general equation used to specify the directional tuning of the sfinds is 
R(0) = b + cos [ a*(0 - (PD)], |0 - PD| < Ti/a, (4) 

R(0) = b, Ti/a <= |0 - PD| < Tt/a, 
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where R is the response as a function of motion direction, “b” specifies a constant offset from 

the zero response level, “a” determines the sharpness (or coarseness) of the directional tuning, 

and PD = the preferred direction of the sfind, defined as the motion direction that produces the 

maximal response ampUtude. The differences among the four alternative sfind tuning functions 

in Figure 5 reflect the values that were chosen for parameters a and b; 

function _a_b 

fA 1 0.5 

ffi 1 0 

fc 2 0.5 

fn 2 0 

In summary, the response of each sfind was computed as a fianction of the speed and 
direction of the local motion vector at its location within a retinotopic map. The overall 
response of each sfind is thus: 

R(e,S) = R(e)*R(S) (5) 

Wide-field collator responses 

Responses of a wide-field collator neuron were calculated by multiplying each sfind 
output with a spatial synaptic weight function, then computing the linear sum of all synaptic 
outputs. Simple, “uniform” output mappings to a collator neuron (Fig. 3, upper portion) were 
based on those proposed by Tanaka et al. (1989), and were designed to result in model collator 
neurons that are selective for one of four cardinal flow field types: clockwise rotation, 
counterclockwise rotation, expansion (positive looming) and contraction (negative looming). 
Also tested were cosine-shaped “gradient” output mappings (Fig. 3, lower portion) similar to 
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Uniform Maps 


Fig. 3 




those that have been observed in the leech body wall mechanosensory system that controls a 
bending reflex (see Lewis and Kristan, 1998), In most tests, collator responses were recorded 
across a range of flow field positions relative to the receptive field of the collator neuron. For 
both uniform and gradient mappings, tests were also performed using a range of mapping 
“bandwidths”. In the following equations for synaptic mappings, bw--= the angular bandwidth of 
a uniform mapping, and “hbw” = the half bandwidth of a gradient mapping function. 

Uniform synaptic weight function: 

w=l, 10|<=bw/2 (6) 

w = 0, 10| > bw/2 


Gradient synaptic weight function: 

w = cos(a*0), |01 <= hbw 
w = 0, |01 > hbw 


( 7 ) 



Results 


Properties of Small-field motion detectors. 

This section first describes the response profiles of the Reichardt-Hassenstein 
correlation-type sfind models, then compares them with profiles obtained fi-om the purely 
analytical sfind model. 

In both two-channel and three-channel versions of the correlation-type circuit (Fig. 4), 
maximal responses are exhibited at motion directions near zero degrees and motion speeds near 
approximately 120 deg/s. (The speed response maximum is determined by the value chosen for 
the internal delay). Note, however, that there is no single “preferred direction” that holds for all 
motion speeds. This is particularly evident in the two-channel model (Fig. 4A,B), in which the 
strongest responses form two distinct peaks, and responses at slow speeds show strong 
fluctuations as a fimction of motion direction. This complexity arises because the emd responses 
are determined by the temporal delay in the arrival times of a contrasting edge at different input 
channels, and this delay is affected not only by changes in motion direction, but also by changes 
in motion speed. At increasingly slow speeds, strong responses are still possible as long as the 
combination of motion direction and speed results in favorable delays. This phenomenon is 
related to that discussed by Zanker (1990) with reference to responses of wide-field motion- 
sensitive neurons to moving gratings. 

The actual circuitry for sfinds of insects rem^s unknown, but almost certainly is more 
complex than the 2-channel correlator represented in Fig. 2A. Because neuronal morphologies 
and synaptic connections tend to be more difiuse, the more spatially distributed sfind circuit of 
Fig. 2B may be more biologically realistic. This circuit also has the interesting property of 


5-14 




Fig. 4 



exhibiting reduced complexity in the response profile to motion direction and speed. As 
illustrated in Figs. 4C and D, the more distributed sfmd circuit produces only one major 
response maximum and one major minimum in the response profile. The shape of the profile is 
also more symmetrical and the contours less steep than in the 2-channel model. In summary. 
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computationally undesirable features of the simple two-chaimel model are reduced by assuimng 
a more spatially distributed, and probably more biologically realistic version. 

Figure 5 (F-I) shows the response profiles of the purely analytical model sfinds as a 
function of motion direction and speed, illustrated for the four alternative directional tuning 
Sanctions fA, fe, fc, fo- The overall response profiles defined by the analytical model are 
qualitatively similar to those of the correlation-type sfind models discussed above. As in the 3- 
channel version of the correlation model, each profile is characterized by a single response 
maximum that defines both a preferred direction and a preferred speed. As in versions of the 
correlation model that employ a subtraction stage, the analytical models that employ combined 
excitatoryfinhibitory directional tuning (fe and fo) produce a response minimum at the preferred 
speed (when motion direction is 180 degrees fi'om the preferred direction). The sfinds based on 
purely excitatory directional tuning (fA and fc) have a clearly defined response maximum, but no 
well-defined minimum. 

n. Model Outputs: Optic Flow Selectivity in Wide-Field Collator neurons 

Non-specialized (one-to-one) Synaptic Mcps 

The following two figures compare responses of the wide-field model to various optic 
flow field types and positions, also as a function of the four alternative sfind directional tuning 
functions described in the Methods. First, figure 6 illustrates the properties of a simple circuit in 
which the spatial pattern of synaptic connection strengths fi’om the sfinds to the collator neuron 
is completely uniform (one-to-one). With this unspecialized synaptic mapping pattern. 
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selectivity for optic flow position is obtained with three of the four directional tuning functions 
(fA, fc and fo), but there is no selectivity for optic flow type. Clearly, whereas position 
selectivity is a simple consequence of pooling the outputs from an array of small-field detectors, 
optic flow type selectivity requires some level of nonuniformity in the pattern of synaptic 
weights. As subsequent results demonstrate, however, it is possible to generate remarkably 
robust selectivity for optic flow type using any of a broad range of mappings. 

Circuits for Flaw Field Selectivity, I: Alternative Directional Tuning Functions 

In order to test the properties of alternative directional tuning fimctions, a uniform 
synaptic mapping pattern was chosen based on a conceptual model of optic flow processing in 
primate cortex (Tanaka et al., 1989). A synaptic mapping pattern designed for clockwise 
rotation detection is illustrated in Figure 3 (upper patterns). Connections from sfinds whose 
“preferred direction” is rightward motion (0®) are specified only from an upper quadrant of the 
sfind array, connections from upward - sensitive (90®) sfinds are made from the left quadrant of 
the array, and so on. Corresponding mapping patterns for counterclockwise rotation, expansion 
and contraction were specified simply by shifting the polar coordinates of the patterns for 
clockwise rotation by +180®, -90®, and +90®, respectively. Similarly, patterns for non-cardinal 
optic flow types (such as clockwise rotation or counterclockwise expansion) were specified by 
using intermediate shifts. 

Collator responses obtained using the 90® -bandwidth, uniform maps for pure clockwise 
rotation are shown in Figure 7, which has the same overall format as Figure 6. In contrast with 
the outputs from one-to-one mapping, all sfind tuning functions now result in position selectivity 
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to two or more of the cardinal flow types. More importantly, some degree of flow type 
selectivity is now obtained with all four directional tuning functions. Among the four tuning 
functions, clearly fa (Fig. 7B) provides the most effective overall selectivity for clockwise 
rotation. Although other functions (Fig. 7A, C, D) also produce strong re^nses to centered 
clockwise rotation, sensitivity to unidirectional flow and complex responses to other flow types 
both compromise the desired selectivity for clockwise rotation, particularly in flgure 7 A and C). 
Only with fn ate strong, position-sensitive responses to clockwise rotation combmed with (1) 
symmetrical, inhibitory responses to the opposite flow type (counterclockwise rotation) and (2) 
complete insensitivity to contraction, expansion and unidirectional flow. 

These results demonstrate that exceUent selectivity for clockwise rotation can be 
obtained using a simple uniform synaptic mapping pattern designed to maximize responses to 
this cardinal flow type. Other tests have employed analogous uniform mappings designed to 
produce selectivity for flow types other than clockwise rotation (counterclockwise rotauon, 
expansion, contraction, and intermediates). The results of these tests (not shown) are identical 
to those in Figure 7, but with response patterns switched according to the flow type for which 
the mapping pattern was designed. For example, by using a unifoim, 90“ -bandwidth mappmg 
designed to detect pure expansion, fa produces excitatory responses to expansion, inhibitory 
responses to contraction, and no response to unidirectional flow or to clockwise or 
counterclockwise rotation. In summary, exceUent selectivity for any arbitrary flow type is 
possible by combining an appropriate mapping pattern with the broad, excitatory/inhibitory 

small-field directional tuning function fs- 
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Robustness of optic flow selectivity at different motion speeds 

In order to test further the robustness of selectivity for optic flow type in this model, 
responses using directional function fe were obtained over a wide range of optic flow speeds, 
spanning the sfind speed response function fi'om zero to speeds far beyond the preferred local 
motion speed (as defined by the constant k in Equation 2). In response to stimulation with the 
“preferred” optic flow type defined by the synaptic maping pattern, position selectivity remains 
robust over a broad range of speeds, and is degraded only at the very slowest speeds (Fig. 8A). 
Moreover, sensitivity to non-preferred flow types (Fig. 8B) remains low across the entire range 
of motion speeds tested. 

Coarse coding for optic flow type and position 

A widespread feature of many neuronal information processing systems is that individual 
elements tend to be broadly tuned (coarsely coded) to each of several parameters. This 
important biological feature is reproduced quite well in this model. A broad range of optic flow 
fields, although they represent complex features of visual stimuli, can be represented on a linear, 
circularly symmetrical “flow type” scale that is analogous to simpler parameters such as angle 
and phase. In the example illustrated in Figure 9, the parametric values for pure expansion and 
contraction differ by 180®, and rotational fields are orthogonal to looming fields. Conceivably, 
selectivity for flow field type could be either narrowly or broadly tuned. In this model, using 
sfind function fe, the tuning is broad indeed: the pattern of responses to optic flow type has a 
half-bandwidth of 180® and precisely fits a cosine function. Selectivity for flow field position is 
also coarsely-coded, as is demonstrated in Figure 9B for an expansion detector circuit. 
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Circuits for Flew Field Selectivity, II: Alternative Sfind Output Mcq)pings 

The results described so far suggest that the circuit requirements for robust, coarsely- 
coded selectivity for optic flow type and position are remarkably simple: (1) a small-field 
processing level, composed of arrays of sfinds having (a) a suitable motion speed response 
fimetion, (b) broad directional tuning, and (c) directional tuning that includes both excitatory and 
inhibitory responses (e.g. directional fimetion fe). (2) a wide-field “collation” stage, created by 
linearly s umming the outputs of a mapped subset of the sfinds, the map for each sfind array 
ha ving been designed to promote maximal responses to a particular optic flow pattern when it is 
centered on the collator. 

Despite their efiectiveness, these specifications for an optic flow analyzer could suffer 
fi'om at least two major shortcomings. First, it is possible that deviating fi’om the 90° bandwidth, 
unif orm maps employed so far could compromise the robust optic flow selectivity that has been 
observed. Second, the model appears to exhibit sensitive dependence upon suitable directional 
tuning of the small-field elements, suggesting that function fs -like characteristics must be 
preserved at all times [in order to maintain effective optic flow processing]. If, however, certain 
details of directional tuning and output mappings are not crucial to the success of the circuit, far 
greater design flexibility would be possible. This kind of flexibility would be of considerable 
interest, both for understanding the evolutionary and developmental processes that have 
produced biological optic flow processing circuits, and also for evaluating the feasibility of 
manufacturing man-made processors based on similar designs. This section therefore 
investigates the extent to which alternative synaptic mappings may affect the model’s 
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performance, and how to reduce the model’s sensitivity to directional tuning characteristics 
through improvements to the synaptic mapping architecture. 

In contrast with the uniform maps employed to this point, “gradient maps” are 
characterized by spatial gradients in synaptic connection strength (Fig. 3). Both types of 
mapping pattern are biolo^cally plausible, and cosine-shaped gradient maps may be crucial to 
the performance of distributed-coding circuits that share features with this optic flow model (c.f 
Lewis and Kristan, 1998). Thus, cosine-shaped gradient maps were tested as an alternative to 
uniform maps. Parts A-C of Figure 10 show the results of combining directional function fe 
with cosine-shaped synaptic connectivity maps having half-bandwidths of 90, 180 and 270®, 
respectively. As with the uniform maps presented above, the total range of synaptic weights is 
from 0 to 1. There is no qualitative change in either the position selectivity or optic flow type 
selectivity, and only minor changes in the maximum response amplitudes. Figure 10 D examines 
the relationship between maximum response amplitudes and map broadness (uniform map 
bandwidth or gradient map half-bandwidth) in more detail, and confirms that regardless of the 
type of mapping, strong response ampUtudes are possible across a wide range of bandwidths. 

The results in Figure 10 show that as long as the mapping bandwidth is not extremely 
narrow or broad, model performance is not highly sensitive to the precise bandwidth value. 
Having estabUshed that considerable design flexibility in mapping bandwidth is permissible, can 
adjustments of this parameter be used to reduce the model’s sensitivity to sfind directional 
t iming characteristics? As with the directional tuning functions, bandwidth is only one of two 
major characteristics of the mapping functions. The second, crucial feature is the offset from 
zero, which determines whether synaptic weights are dominated by excitatory or mhibitory 
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values. Preliminary tests indicate that an interplay between the mapping function offset and 
bandwidth determines the sensitivity to sfind directional tuning. With cosine-shaped gradient 
mappings, a very effective (and possibly optimal) solution employs a half-bandwidth of 180® and 
a zero offset. As illustrated in Figure 11, this mapping function nearly eliminates the model’s 
dependence on sfind tuning characteristics. Now, fianctions fs, fc, and fo all produce excellent 
selectivity for flow field type and position; only fx remains ineffective. 

Discussion 

This computational model combines biologically realistic responses of small-field motion 
detectors to motion direction and speed, with hypothesized spatial and physiological features of 
synaptic connections fi'om the small-field detectors to a wide-field, linear summation stage. 
These simple design features were inspired by known structural and functional characteristics of 
motion-processing circuits in insect brains. Extensive tests of the model’s responses to various 
flow field stimuli demonstrate robust selectivity for flow type and position. Moreover, these 
tests demonstrate considerable design flexibility in the specification of (a) directional tuning 
functions of the small-field motion detectors, and (b) spatial patterns of output connections to 
the collator stage. The model makes various predictions regarding the functional organization 
of optic flow processing circuitry in insects, many of which can be tested using existing 
technology for neurophysiological and neuroanatomical investigations. In addition, the model 
suggests which design features will be crucial for future biomimetic machine vision hardware. 
By analogy, the same features should also be useful for other flow field applications, such as 
hydrodynamic flow analysis using mechanosensors instead of photosensors. Because of the 
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robust behavior and flexible design features of this model, it can provide a strong basis for future 
designs of inexpensive, yet sophisticated guidance control systems for munitions and other 
automated vehicles. The main conclusions from this investigation are discussed below. 

This model makes various predictions, several of which are testable with physiological 
recordings and anatomical studies of insect brains. 

1. The well-known 4-layered directional organization of the calhphorid fly lobula plate 
(Strausfeld, 1989) is not required for optic flow processing. This leads to a hypothesis: not all 
dipterans need to have the 4-layered lobula plate structure, and in those that do, this must have 
some other, as yet unidentified functional significance. Current research is also testing the 
assumption that the sfind outputs (T5 cells) are indeed segregated to 4 distinct strata in the 
lobula plate. 

2. Directional tuning properties of sfinds, and their synaptic mappings to optic flow- 
selective vdde-field neurons are interrelated, such that any physiological data regarding one type 
of property will suggest predictions about the other. For example, if directional tuning of sfinds 
varies according to stimulus conditions, the model predicts that optic flow analyzers should have 
sophisticated synaptic mapping patterns (e.g. involving 2 or more neurotransmitters). 
Conversely, if spatial synaptic maps involve purely excitatory transmitters, directional tuning 
functions are expected to be similar to function fB (Fig. 5). In this case, the model predicts that 
additional mechanisms are required to stabilize sfind directional tuning functions under varying 
input conditions (e.g. different spatiotemporal frequencies). Additional recordings from sfinds 
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are needed in order to test their response profiles as a function of both motion direction and 
speed. 

3. The model predicts “coarse coding” for flow field type, as has been observed in 
primates (Graziano et al. 1994). This prediction can be tested in real optic flow neurons in 
insects. The model also predicts selectivity for flow field position, with details dependent on the 
speed tuning of the sfinds relative to the receptive field size of the collator neurons. 

4. The model makes detailed predictions of the spatial patterns of excitatory and 
inhibitory inputs to dendrites (or postsynaptic spatial patterns of dendritic activity) of collator 
neurons (JK Douglass, in preparation). In principle, these predictions are testable with 
intracellular recordings using voltage-sensitive or calcium-sensitive dyes. 

What are the key design features for optic flow processing in this model? Optic flow- 
selective neurons with realistic biological properties can be made using a simple feed-forward 
network composed of two basic stages; (a) a few arrays (here, four) of correlation-type sfinds 
that respond according to temporal delays determined by motion direction and speed, and (b) 
simple spatial patterns of purely excitatory synaptic connection strengths that can be designed to 
generate selectivity for a wide variety of flow fields. Thus, a single set of sfind arrays can be 
used to generate model collator neurons that are selective for rotation, expansion, contraction, 
unidirectional flow, or intermediate flow types. This result provides a detailed demonstration of 
a basic principle of information processing that has been documented in insect visual systems: 
shared, evolutionarily conserved peripheral mechanisms (Douglass and Strausfeld 1995; 
Buschbeck and Strausfeld, 1996) provide the basis for various types of higher-level processing 
(Buschbeck and Strausfeld, 1997). 
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In summary, the biomimetic flow-selective properties exhibited by this model include 
“coarse coding” for optic flow type, maintenance of selectivity for optic flow type across a range 
of optic flow speeds, and sensitivity to optic flow position relative to receptive field center of a 
flow-selective collator neuron or summating element. These selectivities can be produced by 
optimizing either of two basic design features: the response properties of the sfinds and the 
spatial S 5 maptic output mappings to the collator stage. If roughly optimal values for either of 
these two features are met, robust model behavior can be had without the need for tight control 
of the other parameters. The obvious implication for circuit design is that by identifying which 
design parameters can be optimized more precisely and at lower production cost, tremendous 
savings can be realized because of the increased flexibihty in achieving those parameters that 
would be more expensive to specify according to stricter tolerances. 
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Prediction of Compression textures in Tantalum 
by a Pencil-Glide Computer Program. 


Background 

For bcc metals deforming by <111>-pencil glide, lattice rotations can 
be uniquely predicted. The purpose of this project is to predict the 
texture development in tantalum during compression. After heavy 
compression, of bcc metals, the grains become oriented so that either 
their <111> directions or <100> directions are aligned with the axis of 
compression (1). 

An existing computer program (2) based on a true <111 >-pencil glide 
model has been used to relate the plastic properties of bcc metals to their 
texture. The present work is to modify this program so that it can predict 
orientation changes during axisymetric compression. This program 
recognizes that with <111>- pencil glide, there are eight degrees of 
freedom: these degrees of freedom being the amount of slip in each of the 
four <111 > slip directions and the orientation of the slip planes 
associated with each of the four directions. For any general shape change, 
five degrees of freedom of slip are necessary, so a general shape change 
can be accomplished either by slip in three <111> directions or by slip on 
all four <111 > directions, with the orientations of the slip planes 
providing the extra degrees of freedom. 

The calculation procedure explores both possibilities. For the three 
active slip direction possibility, one slip direction is assumed to be 
inactive. Then an orientation of one of the slip planes is assumed. The 
orientations of the other two are then calculated, as well as the amount 
of slip in all three directions, the total amount of slip, ZdlTjl/de, being 

noted. Then a new orientation of the slip plane is assumed and the 
calculation is repeated. Such calculations are continued until the 
minimum value of Zdlyjl/de is found. This procedure is then repeated, each 
time with a different inactive slip direction. Finally, the solution with the 
lowest value of Zdl7|l/de is noted. 

The other possibility is that all four slip directions are active. 

Piehler and Backofen (3) found that there are only four stress states that 
can activate all four slip directions simultaneously. The virtual work to 
achieve the shape change is calulated for each. The best of the four-slip 
direction solutions is the one for which the virtual work is a maximum. 

The optimum four slip system solution is then examined to see if the 
stress state required to produce simultaneous slip on the four systems is 
admissible. If it is, this is the appropriate solution. Otherwise the three 
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slip system solution is appropriate. 
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The necessary work is to identify the slip systems operative for the 
appropriate solution and the amount of slip on these and use this 
information to calculate the lattice rotations. The mathematics required 
for the analysis has been worked out and is outlined below. 

Orientation description 

The system used in this project of describing the orientation of an 
axis, z, relative to the cubic crystal axes Involves two angles, 0 and (j) as 
shown ion figure 1. The angle (j) can be thought of as the latitude and 0 as 
the longitude if 3 = [001] is the north pole and [100] lies on the equator at 
the Greenwich longitude. 3 ,qq, 


Figure 1. Stereographic representation 
of the orientation of the z-axis, in terms 
of 6 and (|). 



2*010 


The objective of the program is to determine the orientation changes 
of grains of various orientations undergoing deformation that is axially 
symetric about z. Therefore the input shape changes will be in small 
increments of compressive strain, (e.g. = -0.1). For axisymetry then 

Aex = Afiy = - (1/2)Ae2 


These strains can be transformed into strains along the cube axes by the 


following equations: 

Ae-j = Ae2[cos^(t)cos^0 - .5(sin2(t>cos20 + sin20)] (10) 

Ae2 = Ae2[cos2(|)sin20 - .5(sin2(|)sln20 + cos^O)] (11) 

A£3 = Ae2[sin2(t>- .Scos^cf) ] (12) 

Ay 23 = Ae2[2sin(j)COS(|) sin0 + cos4)sin<|)sin0] (13) 

Ay 3 i = Ae2[2sin<|)C0S(j) cos0 + cos4)sin<|)cos0] (14) 

Ayi 2 = Ae23sin0cos03cos24) (15) 
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In the <111>- pencil glide model, it is assumed that the slip 
directions are the close-packed <111> directions and the slip planes are 
those planes containing for which the shear stress is the highest. Figure 2 
shows how the orientations of the normals to these planes can be 
described by an angle, \|f, of rotation about the slip directions. The 
direction cosines between these slip elements and the 1 = [100], 2 = [010] 
and 3 = [001] crystal axes are listed in Table 1. 


Figure 2. Stereographic representation 
of the pencil glide systems, a, b, c, and d 
are the slip plane normals for the A = [ITI], 
B =[111], C = [1 IT] and D = [ITT] slip 

directions. Vc’ 

rotations of the slip plane normal about these 

axes. 



Table I Direction cosines of the <111 > slip directions and slip-plane 
normals with the cubic axes 1 = [100], 2 = [010], and 3 = [001] 


Slip directions slip plane normals 

A = [in] 


il d = ^(2/3)cos\i/a ^-1 n 

iJ. 2 d = V(2/3)cos(60+\tfa) =-''^3 

ASd = -V(2/3)cos(60-\|fa) 5-1 n 

B = [111] 

Aid = V(2/3)cos\tfb Aip=1/V3 

A 2 d = -V(2/3)cos(60-vb) n =''/^3 

Asd = V{2/3)cos(120-\|/b) Ain =1/V3 

C = [11T1 

Aid = V(2/3)cosvc Ain=1/V3 


A 2 d = ^/(2/3)cos{120-\|rc) ^2n = ■'/^3 

Asd = -V(2/3)cos(60-vc) ^3n = -■' ^^3 


D = [1TT1 

Aid = 'i(2/3)cosvd iln = ‘'/^3 

A 2 d =-^(2/3)cos{60-vd) A 2 n = -1/>/3 

A3d = V(2/3)cos(60+vd) i^3n = -■'/^3 
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Rotations 

Slip on each of the slip systems results in a rotation, coj, of the 
crystal orientation about the three crystal axes, that depends on the 
orientation of the slip plane, Vj, and the amount of slip, Yj, on that system. 

The net rotations about each of the crystal axes are the sums of the 
rotations, ®j, cause be each slip system. The equations relating the 

rotations about the cubic axes to the individual values of \|/j, and y; are 

listed below. For rotation about the 1 = [100] axis, 

(01 = Z(Y/2)(i2D^3N- ^ 2 N^ 3 d) summed over all 4 slip systems or 

(0i = (V3/6)[Ya(i2Da‘*'-®-3Da)'’‘'Yb(^2Db"^3Db)''’'Yc(‘^2Dc‘^3Dc)'^ Td(‘^2Dd'''■^3Dd)l 

(16) 

For rotation about the 2 = [010] axis, 

©2 = £(y2)(i3DiiN- S. 3 n!^id) summed over all 4 slip systems or 

(02 = (V3/6)[7a(l30a-<^1Da)+'»b(^3Db-<^1Db)+tc(<^3Dc+*^1Dc) +'lfd(*^3Dd+<^1Dd)l 

For rotation about the 3 = [001] axis, 

(03 = 5:(Y/2)(iliD^2N- ^in^ 2 d) summed over all 4 slip systems or 

(O 3 = (>/3/6)[Ya(- 5 .1 oa’ ^2Da)‘‘'Tto( ^ 1 Db' ^2Db)+Yc( Dc' ^2Dc)‘'’T^d(’ ^ 1 Dd* ^3Dd)^ 


Fypressinq rotations about cube a xes as ch qnqes pf ^ and . Q , 

The change A({) resulting from a rotation (Op can be deduced from the 


spherical triangles represented in figure 3 as follows: 
3 ^ 



Figure 3. Change in (j) and 0 from a rotation 


sinq = cos(90-(t)) = sinpcosvi/p so 
\|/ = sinq/sinp^. 

Here cosp^ = cosScoscj) is a constant during rotation, ©•]. 

Since sin[(tH-(A(l))i] = sinpiCos(\jri+©i), cos [(j>+(A<!))i] = cosp-|COs[e+(A0)i], so 
January 26, 1999 


Dr. William Hosford 


the change in q can be found from 

COS[0+(A6)-|] = COS[<t)+(A()))-|]/COSpi (20) 

The change resulting from a rotation ( 02 , can be found from the 



Figure 4. Change in ({) and 0 from a rotation 0)2 

sin[0+(A<t))2] = sinp2COs(\|r2 +©2), (21) 

where cosp 2 = sin 7 tsin 0 and cos\|r 2 = sin<|)sinp 2 so 

sin [0 +(A 0 ) 2 ] = sin(t)/sinp 2 . ( 22 ) 

The change Atj) resulting from a rotation © 3 , can be found from the 
spherical triangles represented in figure 5. 


3 



Figure 5. Changes in (]) and 0 from a rotation (O 2 
P 3 = 90 - ()) SO 

©3 = -(A0)3 (22) 

( 24 ) 


(A(t))3 = 0 

January 26, 1999 
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The overall orientation change is then given by 
A<|) = (A<t))i + (A(!))2 
and 

A0 = (A0)i + (Ae)2 + (A0)3 


" elappliin'g sucessive increments of Ae^, the changes of orientation 

with comoressive strain wiii be monitored. The resuits wiii aiiow us to 
predict which orientations wiii rotate toward an [1001-end orientation 
and which toward a [111 l-end orientation. 

The resuits wiii aiiow prediction of the strain levei necessary to 
deveLpe a J^ven strength of texture. The vaiidity of this approach w I be 
checked by comparing these predictions with experimental observatio . 

this project is far behind schedule. Money for the preset 
was not received by the University of Michigan until May 1998. Because 
this it was not possible to hire a student to work on the project unti 
January 1999 when Daniel Greszczak was hired. 

The existing Fortran program for pencil glide described above has 
been acquired. Modififications to this program for axisymetnc 
deform^ion about a single crystallographic 
has involved becoming familiar with the program and changing of 

notation for axes. 

The necessary mathematical equations (1 to 26) for calculating 
lattice rotations from slip have been worked out. 

1 C S Barrett Trans. rA/A/fE, 105 (1939) p. 296. 

2 R W Logan "Upper-Bound Anisotropic Yield Loci Calculations 
Lsumin^ <lT>-Pendl Glide," Int J. Mech. Sci. 22 (1980) 419-59. 

« H R Piehler and W. A. Backofen, "A Theoretical Examination of the 
Plastic Properties of BCC Crystals Deforming by <111 > Pencil Glide. Me . 
Trans. 2 (1971) 249-55. 
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Abstract 

An efficient and scalable parallelization of a sequential Fortran time-dependent Maxwell 
equations solver using High Performance Fortran (HPF) has been implemented. This report 
describes the background of the project, the theory behind the efficiency being achieved, the 
parallelization methodologies employed, and the experimental results obtained on the Cray 
T3E massively parallel computing system. 

The program is parallelized through HPF directives. The major directives used in the 
code are the ALIGN, DISTRIBUTE, and INDEPENDENT directives. Through these direc¬ 
tives, we were able to distribute the arrays according to our parallelization strategy and to 
instruct the parallel compiler how to parallelize the code efficiently. 

Experimental runs show that the execution time is reduced drastically through parallel 
computing. Due to the requirement of the HPF compiler I used, many huge arrays have to 
be initialized before their usage, a lot of overhead is introduced. However, the code is still 
scalable up to 98 processors on the Cray T3E. In fact, based on our experience we predict 
that the execution time can be further reduced by using 196 processors. Unfortunately, the 
T3E system we used has only 128 processors and we cannot confirm this. 

Based on the experimentation carried out in this research, we believe that a high level 
parallel programming language such as the High Performance Fortran is a fast, viable and 
economical approach to parallelize many existing sequential codes which exhibit a lot of 
parallelism. 
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1 Introduction 

Computational electromagnetics (CEM) in the time domain is the most general numerical ap¬ 
proach for describing dynamic or wide-band frequency electromagnetic phenomena. Computa¬ 
tional simulations are derived from discretized approximations to the time-dependent Maxwell 
equations [1-10]. High numerical efficiency of CEM simulation procedures can be attained either 
by algorithmic improvements to solve the Maxwell equations or by using scalable parallel dis¬ 
tributed memory computer systems. Since the massive volume of data processing are involved 
in solving the Maxwell equations, distributed memory computer systems are viable means to 
solve the memory shortage problem on workstations or vector computer systems. The other 
advantage is reduced time when parallel processing is employed to solve the Maxwell equations. 
Hence, parallelization of existing sequential Fortran code for solving Maxwell equations is an 
important effort towards developing efficient and accurate CEM code in analyzing refraction 
and diffraction phenomena for aircraft signature technology. 

Previously, three versions of of the solver were available. One is the sequential Fortran version 
developed by Dr Joseph J.S. Shang for vector machines such as the CRAY C90. The second 
is the MPI (Message Passing Interface) [15-16] implemented by Dr. Marcus Wagner of IBM. 
The third one is the Power Fortran version [14] developed by me during the summer of 1997 
when I was an AFOSR Faculty Research Fellow at WPAFB. The description of the programs 
and their performance results are summarized in [11]. Using the message passing paradigm has 
several disadvantages: the cost of producing a message passing code may be between 5 and 10 
times that of its serial counterpart, the length of the code grows significantly, and it is much 
less readable and less maintainable than the sequential version. For these reasons, it is widely 
agreed that a higher level programming paradigm is essential if parallel systems are to be widely 
adopted. The results obtained during my summer research show that the performance of the 
parallel PFA code is even better than the MPI counterpart besides the advantages stated above. 
This suggests that a higher level programming paradigm such as HPF is a viable and alternative 
way to parallelize CEM codes. 

However, the results achieved during my summer research is only a preliminary step in 
achieving a scalable version of the MAX3D code. The major problem with the PFA version is 
that it is not portable across different parallel computer systems since Power Fortran is only 
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available on SGI Power Challenger and Origin 2000. 

The primary research objectives of this research are to develop parallel code for the MAX3D 
using High Performance Fortran (HPF). HPF is more powerful than Power Fortran and is 
available in almost every major parallel computer systems. Our goals are to further improve the 
scalability of the Power Fortran version of the MAX3D code, investigate the possibility of using 
HPF to implement parallel GEM applications, make the parallel code portable across several 
parallel platforms, and present the results to other researchers via conference and/or journal 
pubhcation. These objectives have been accomplished through implementation of the parallel 
code for the MAX3D via PGHPF (a HPF compiler developed by Portland Group). Experimental 
runs show that the execution time is reduced drastically through the usage of HPF. The time 
spent for the implementation is about two months. Compared with MPI implementation, it 
is much shorter. Since HPF is a general parallel language and not targeted to any particular 
machine, its performance is usually not as good as specialized parallel language such as Power 
Fortran on the SGI machines. Yet, the HPF code we produced is stiU scalable up to 98 processors 
on the Cray T3E. Based on the experimentation carried out in this research, we show that the 
High Performance Fortran is a viable and alternative way to parallelize many existing sequential 
codes which exhibit a lot of parallelism besides MPI. In many cases, using HPF to implement 
a sequential Fortran code is more economical than using MPI. HPF also solves the problem of 
limited memory size on a single processor and provide a fast way to port the code on parallel 
computer systems. 

In the following sections, I will present the work performed, the methods used, and the 
results achieved during the research, and outline the impact of the proposed research. 

2 Methodologies 

High Performance Fortran (HPF) is an informal standard for extensions to Fortran to assist 
its implementation on parallel architectures, particularly for data-paraUel computation [18]. 
Among other things, it includes directives for expressing data distribution across multiple mem¬ 
ories, extra facilities for expressing data parallel and concurrent execution, and a mechanism for 
interfacing HPF to other languages and programming models. HPF is available on almost any 
popular paraUel computer systems, such as T3E, SP2, SGI Origin 2000, and is much easy to 
use and fast to port than MPI [15-16]. HPF allows you to add HPF directives in the sequential 
Fortran code and to apply the capabilities of a multiprocessor system to the execution of a single 
job. It splits the job into concurrently executing pieces, thereby decreasing the wall-clock run 
time of the job. Although HPF uses a lot of comphcated analysis tools to analyze the user 
program and can produce a corresponding parallel version without user intervention, the quality 
of the parallel code produced in such a way is usually very poor. In my study, I found that using 
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the PGHPF compiler’s option -Mautopar to parallelize the MAX3D code automatically, the 
parallel results are not effective. The compUer is simply not intelligent enough to make a smart 
decision. Another problem with automatic parallelization is that it can create a large number 
of temporary arrays. Hence, automatic parallelization has to be combined with hand tuning to 
produce more efficient code. In my research, only hand parallelization is used to avoid memory 
waste and increase efficiency. 

The HPF model can be seen as a collection of distinguishable, but complementary, program¬ 
ming styles. The models and methods I used in the implementation are data sharing and work 
sharing. In data sharing, data, such as an array, are spreaded over the memory of aU of the 
available processors so that each processor operates primarily on its own part. In work sharing, 
the loop iterations are distributed within loops among the system’s processors with the goal of 
executing them in parallel. For instance, you can use the INDEPENDENT directive to divide 
the iterations of a DO loop among processors, or you can let the compiler divide the work for 
you by choosing implicit array syntax. One natural and powerful strategy involves distributing 
an array, data sharing, and the iterations of a DO loop that operate on that array, work sharing, 
over your available processors. Processors executing a DO loop in parallel help you realize the 
power of the CRAY T3E and other parallel computing system. In the following sections, the 
details of data allocation and parallelization schemes are discussed. 

2.1 Distributed Data Allocation 

In a parallel program, data can be shared or private. Arrays can also be distributed to speedup 
the code. Arrays are distributed across all processors via the !HPF$ DISTRIBUTE directive in 
HPF. The DISTRIBUTE directive names the variables that are to be shared data objects and 
specifies how they are to be distributed across the PEs. If the data is not specified as shared, 
it is private (the default). In a program that declares a private data item, each processor gets 
a copy of storage for that item. In many programming models, this data is called replicated. 
When an array is shared , its elements are distributed across the available PEs. 

The DISTRIBUTE directive can specify array distribution within the directive by following 
the array name with distribution information contained in parentheses. The distribution infor¬ 
mation may include the keywords BLOCK or CYCLIC, which teU the compiler how to distribute 
array elements among the available PEs. 

The DISTRIBUTE directive allows data to be distributed over processors in a variety of 
patterns. The ALIGN directive is used to s[ecify that certain data objects are to be mapped in 
the same way as certain other data objects. Operations between aligned data objects are likely 
to be more efficient than operations between data objects that are not known to be aligned. The 
BLOCK distribution distribute a block of consecutive memory locations to a processor. In our 
implementation, UO, Ul, U2, F, and G are all aligned and have the same distribution, as shown 
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below. Hence, we can parallelize the code across dimension K efficiently when working on these 
large arrays. The ONTO clause specifies the processor arrangement declared in a PROCESSORS 
directive. In our case, the arrangement is PROCS . 

!HPF$ PROCESSORS PROCS(NUMBER_0F_PR0CESS0RS()) 

!HPF$ ALIGN Ul(I.J.K.L) WITH U0(I,J,K.L) 

!HPF$ ALIGN U2(I.J,K.L) WITH U0(I,J,K.L) 

!HPF$ ALIGN F(I.J,K,L) WITH U0(I,J,K,L) 

!HPF$ ALIGN G(I,J.K,L) WITH U0(I.J,K,L) 

!HPF$ DISTRIBUTE UOC*,*,BLOCK,*) ONTO PROCS 


Similarly, since we need to parallelize HZETA on dimension J, the H array is distributed on 
dimension J as follows: 


!HPF$ DISTRIBUTE H(*,BLOCK,*,*) ONTO PROCS 

Since different phases require different data distributions, it would be nice to dynamically 
distribute an array during execution. HPF provides such mechanism. They are DYNAMIC and 
REDISTRIBUTE directives. In our research, we also tried to use these two directives to distribute 
arrays dynamically. Since redistribution of large arrays such as UO and H take a lot of time, 
the overhead incurred is lager than the saving in time. Hence, we did not use this redistribution 
scheme in our final implementation. Another distribution method is CYCLIC. This distribution 
distributes an array of elements cyclically to different processors. We found this method was 
not efficient for the MAX3D code. 

2.2 Parallelization 

The major part of the parallelization process for the MAX3D code is loop parallelization since 
the codes contain several big loops. The model of parallelism used focuses on the Fortran DO 
loop. The compiler executes different iterations of the DO loop in parallel on multiple processors. 

The essential compiler directive for multiprocessing in HPF is !HPF$ INDEPENDENT. This 
directive directs the compiler to generate special code to run iterations of a DO loop in parallel. 
The loop is also called shared loop. In a shared loop, the iterations are divided among the 
available tasks. A shared loop can be specified by the INDEPENDENT directive. A private 
loop, by contrast, is executed only by the task that invokes it; no work is shared among tasks. 
Private loops are not preceded by an INDEPENDENT directive. For example, the following 


7-6 



directive specifies that loop K in subroutine FXI should be parallelized and I, IM, etc, are 
private variables. 


!HPF$ INDEPENDENT, NEW(I, 
!HPF$&RZ, 

!HPF$&UJX. UJY, UJZ, SUJ, 
!HPF$&UP1. UP2. UP3, UP4, 
!HPF$&UC2, UC3, UC5, UC6, 
!HPF$&FM6. FM) 

DO 1 K=1,KLM 
DO 1 J=1,JLM 
DO 2 1=1,ILM 


IM, IP, J, SSXI, RSSXI, UIl, UI2, UI3, RX, 

RSUJ, UJl, UJ2, UJ3, UKl, UK2, UK3, 

UPS, UP6, 

FP2, FP3, FP5, FP6, FP, FM2, FM3, FM5, 


RY,& 

& 

& 

& 

& 


Shared loops specify the behavior of all tasks collectively, but they define the behavior of 
individual tasks only implicitly. Shared loops do not guarantee the order in which iterations will 
be executed. The lack of a defined order lets the system execute iterations concurrently. 

Inside a shared loop, each task executes its assigned loop iterations as if each task were its 
own serial region. The assignment of loop iterations is accomplished by aligning the iterations 
according to the distribution of the array named in the ON clause of the INDEPENDENT 
directive. 

There is an implicit barrier synchronization at the end of a shared loop, at which aU tasks 
wait until the last one has completed. The shared loop ends after the DO loop that immediately 
follows the INDEPENDENT directive. After the shared loop has finished executing, the tasks 
continue to execute in parallel. 

For multiprocessing to work correctly, the iterations of the loop must not depend on each 
other; each iteration must stand alone and produce the same answer regardless of when any 
other iteration of the loop is executed. Not all DO loops have this property, and loops without 
it cannot be correctly executed in parallel. However, many of the loops encountered in practice 
fit this model. Further, many loops that cannot be run in parallel in their original form can be 
rewritten to run wholly or partially. 

Many loops that have data dependencies can be rewritten so that some or all of the loop 
can be run in parallel. The essential idea is to locate the statement(s) in the loop that can¬ 
not be made parallel and try to find another way to express it that does not depend on any 
other iteration of the loop. Through loop merge and loop switch, we eliminate a lot of data 
dependencies in the code. Some data dependencies can not be ehminated due to the nature of 
the computations involved in the code. Our strategy is to parallelize the loops which do not 
have any data dependencies. For example, loop K in subroutine HZETA has data dependen- 
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cies. However, loop J does not. Hence, we parallelized loop J instead of loop K in subroutine 
HZETA. Similarly, we parallelized loop K in subroutines FXI and GETA. This is also reflected 
in their data distribution: array E, G, and H are distributed on different dimensions as shown 
previously. 

3 Performance Results 

During this research, I used the CRAY T3E computer system at the Ohio Supercomputer Center, 
which is a powerful scalable parallel system with 128 processing elements. Its Peak performance 
can reach 76.8 GFLOPS. Each processor is a DECchip 64-bit super-scalar RISC processor. It 
has four-way instruction issue with two floating-point operations per clock. Each processor has 
on-chip 8 Kbyte direct-mapped LI instruction cache and on-chip 8 Kbyte direct-mapped LI data 
cache. It also has on-chip 96 Kbyte three-way-set-associative L2 unified cache. Each processor 
has a local memory of 16 Mwords (or 128 Mbytes). The clock speed is of the processor is 300 
MHz and the peak performance of the processor is 600 MFLOPS. Although the speed is quite 
fast, compared with other parallel systems such as the SGI Origin 2000, the cache and local 
memory sizes of the T3E are much smaller (Each processor in the SGI Origin 2000 contains a 4 
Mbytes secondary cache). This limits the power of the T3E for programs using a lot of memory 
space such as the MAX3D code. Actually, the effect of smaller cache size on the T3E is not 
only on parallel programs, but also on sequential programs. Although the processor speed in 
the T3E is higher than those in the SGI Origin 2000 or the IBM SP2, the system performce of 
the T3E is no better than the SGI Origin 2000 as shown below in our research. 

Several experiments are carried out to tune our parallel code and to adjust our strategy as 
to how to distribute the various arrays and which loops and subroutines to parallehze. In this 
section, some experimental results are reported for the final parallel code produced. AU the 
times are in seconds in the following presentation. 

Table 1 lists the the timing information for the major subroutines in the code. The corre¬ 
sponding times are also presented in Figure 1. Since these subroutines are repeated many times, 
the time spent there contributes heavily to the total time of the code. As shown in Figure 1, 
aU the subroutines are scalable up to 98 processors. When using 128 processors, no further 
reduction in time is achieved. It is also clear that the subroutine SUM spends the most time 
compared to other subroutines. Due to memory size, the program cannot be executed on the 
T3E with fewer than 4 processors. 

Table 2 shows the execution times the code after loop transformations and hand paralleliza¬ 
tion. Because of the hmited memory space available on each processor in the T3E system, the 
parallel program could be run on the T3E with one or two processors. Hence, Table 2 only 
shows the execution times using at least four processors. Due to the requirement of the HPF 
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No. of Processors 

FXI 

GETA 

HZETA 

SUM 

4 

0.59236 

0.89535 

0.8809 

2.8894 

8 

0.31400 

0.45646 

0.44389 

1.4939 

16 

0.16963 

0.25290 

0.22604 

0.83692 

32 

0.099733 

0.15042 

0.11862 

0.50716 

48 

0.074417 

0.10890 

0.11501 

0.37680 

64 



0.064894 

0.25680 

90 



0.064764 

0.25511 

98 

0.02948 

0.04119 

0.06406 

0.1443 

128 

0.02948 

0.04108 

0.06384 

0.1430 


Table 1: Execution Times in the Major Subroutines. 


Execution Times in FXI, GETA, HZETA and SUM 



Figure 1: Execution Times of the Major Subroutines. 
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No. of Processors 

Execution Time 

4 

16331 

8 

8713 

16 

4853 

32 

2919 

64 

1659 

71 

1680 

90 

1688 

98 

1127 

128 

1148 


Table 2: Total Parallel Execution Times for Problem Size (73 x 61 x 97). 

compiler I used, many huge arrays have to be initialized before their usage, a lot of overhead 
is introduced. Hence, the execution time on the T3E with four processors is larger than the 
execution time on the Origin 2000 with four processors. However, the HPF code on the T3E is 
more scalable than the PEA code on the Origin 2000. From Figure 2, we can also see that the 
execution time reaches its minimum when 98 processors are used. Clearly, the parallel code not 
only reduces the total execution time drastically, but also is scalable up to 98 processors on the 
T3E system. 

We can also see that the time is not reduced when we increase the number of processors 
from 64 to 71 or 90. This is natural since we parallelize the code on dimension K which has a 
size of 97. 71 or 90 processors are not enough to partition the array on dimension K. Due to 
array padding, the actual size on dimension K is 98 and hence has a perfect partition over 98 
processors. Increasing the processor number is not effective unless we can use 196 processors. We 
predict that the execution time can be further reduced by using 196 processors. Unfortunately, 
the T3E system has only 128 processors. 

Because the timing information is not available on the T3E when using only one processor, 
we cannot calculate the speedup exactly. In order to estimate the speedup, we assume that 
the program is executed optimally on the T3E with four processors. This assumption is quite 
reasonable since experiments indicate that the MAX3D code is run efficiently with a small 
number of processors. With this assumption, we can estimate the execution time on the T3E 
with one processor via multiplying the execution time on the T3E with four processors by four. 
Table 3 shows the speedups of the HPF code after loop transformations and hand parallelization 
using the above assumption. The corresponding data is also shown in From Figure 3. It is clear 
that the code is still quite scalable when the number of processors used is 98. 
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Figure 3: Speedups for Problem Size (73 X 61 X 97) 
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The experiments conducted in this research are limited to a fixed problem size (73 x 61 x 97). 
When the problem size changes, the scalability and efficiency may also change. Theoretically, 
when the problem size is bigger, the ratio of computation over communication is also getting 
bigger. This implies that the communication overhead becomes relatively smaller. It is our 
conjecture that the parallel code will be more scalable and efficient when computing larger 
problems. However, more experiments are needed to confirm our conjecture. 

4 Conclusions 

As we all know, the message passing paradigm has several disadvantages: the cost of producing 
a message passing code may be between 5 and 10 times that of its serial counterpart, the length 
of the code grows significantly, and it is much less readable and less maintainable than the 
sequential version. Most importantly, the code produced using the message passing paradigm 
usually uses much more memory than the corresponding code produced using high level parallel 
languages since a lot of buffer space is needed in the message passing paradigm. For these 
reasons, it is widely agreed that a higher level programming paradigm is essential if parallel 
systems are to be widely adopted. The results obtained in my summer research show that 
the performance of the parallel PFA code is even better than the MPI counterpart besides the 
advantages stated above. However, the current version is implemented in Power Fortran, and 
can only be run on SGI Power Challenger and Origin 2000. The current PFA version also has 
the problem of not scalable when the the number of processors is larger than 16. On the other 
hand, HPF is more powerful than Power Fortran and is available in almost every major parallel 
computer systems. 

HPF has reached a critical stage in its history. Having struggled while the compiler tech¬ 
nology evolved into a usable state, the parallel computing community has now found it possible 
to write portable, high-performance implementations for selected applications in HPF. This re¬ 
search indicates that large scale data-parallel applications such as CEM simulations can use 
HPF to achieve reasonable performance. However, there are a number of pitfalls ahead. No¬ 
table among these are the general drift away from Fortran for advanced apphcations, remaining 
difficulties with HPF optimization technology, inadequacy of support for irregular problems in 
current implementations, the continuing need for advanced mathematical library support, and 
the emergence of competing paradigms such as MPI and PVM. 

This research addresses the portability and scalability problems of the MAX3D code through 
using HPF. Our results achieved during the research have demonstrated that the sequential 
MAX3D code can be parallelized much more quickly using HPF than using MPI, and the doce 
produced this way can stiU be executed efficiently. Further, the code is stiU quite scalable. 
Although the HPF code is running more slowly than the corresponding MPI code and the PFA 
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code on the SGI Origin 2000, the problem wiU be alleviated in the future or running on other 
machines. The problem is partially due to the HPF compiler called PGHPF I used during this 
research. The PGHPF High Performance Fortran compiler supports all HPF 1.1 extensions to 
Fortran 90 and a growing number of HPF 2.0 features [19]. The PGHPF compiler version I 
used is 2.3 and it requires that all arrays be initialized before their usage. It also has some other 
hmitations and bugs which will be eliminated in the future. On the other hand, MPI or PFA 
do not have the initiahzation requirement. This could save a lot of time when a large number 
of big arrays are used in the code. Another possible reason is because processors on the T3E 
has snaU memory and cache space. Hence, the HPF code on one processor is slower than its 
counterparts. Nevertheless, the scalability of the HPF code is quite good and is better than 
that of the PFA code and close to that of the MPI code. As newer version comes out and HPF 
compilers become more mature, we expect that the performance of the HPF MAX3D code will 
be improved substantially in the near future with httle change in the code. 
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1. Summary of Effort and Outcomes 


This SREP extension focuses on the development of a refined methodology to 
determine stress distributions in the presence of local material and geometric 
discontinuities, such as cracks and comers in composite materials. The belief that micro¬ 
mechanical damage initiates or propagates in such regions motivates this research. The 
outcomes of this research extension are as follows: 

• Development and Refinement of the ADM (Variational)-Asymptotic 
combination methodology (Section 2). 

• Determination of the matching radius for the ADM and asymptotic solutions 
and the generalized stress intensity factors (Section 3). 

• Assessment of the effectiveness of the developed solution methodology through 
case studies (Section 4). 

• Enhancement of the ADM software to include asymptotic analysis and the 
ADM-Asymptotic solution matching technique. CDROM of enhanced software 
provided in copy sent to the lab focal point, (Dr. N. J. Pagano, AFRL/MLBC). 

• Continued collaboration with AFRL/MLBC researchers on finite element 
analysis and singular stress fields analysis outside of this project scope — Fiber 
break simulation and cmciform simulation. 

The project effort and outcomes correspond with those anticipated in the proposal. 
These results were presented to the laboratory focal point and a team of researchers on 
two occasions. A mid project meeting was held at AFRL, Wright Patterson Air Force 
Base in June 1998. At this meeting the results of the project were reviewed and further 
work was planned. (Attendees: Drs. Pagano and Tandon of MLBC/AFRL, K. Pochiraju 
and J. Charvet of Stevens). A second meeting was held during the ASME winter 
conference at Anaheim, CA with Drs. Pagano, Tandon, and Pochiraju in attendance. The 
project outcomes were reviewed by the laboratory focal point and the contents of this 
report were presented to the AFRL researchers. Subsequently, the software developed 
under this project was delivered. 

The funds allocated in this project were used for supporting one graduate student 

for two semesters, for institutional overhead , and for travel. The travel included ASME 
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winter conference and to the AFRL, Wright Patterson Air Force Base, for the status 
meeting. 

The results of this project are documented in several technical conferences. 
Journal articles based on this work are currently prepared for submission. The 
Appendices A and B contain the presentation material and the manuscript being prepared 
for journal submission. 


2. The Hybrid-Variational Asymptotic Method _ 

In this section a technique for embedding the asymptotic solutions within selected 
regions of the variational model [1-5] is described. This technique modifies the 
variational solution while ensuring that the global equilibrium, boundary conditions and 
the constitutive relationships are satisfied. Stress continuity between the asymptotic and 
variational solution at one selected point inside the domain is matched to determine the 
unknown coefficients in both ADM and variational models. Appendix - A provides a 
detailed report with the background on the development of the model and the 
formulations. 

Consider an axisymmetric model of fiber pullout specimens as shown in Figure 1. 
In obtaining the refined solution, a region of radius p*, is identified around the singular 
point (comer-0 in Figure 2). In this region the asymptotic solution [6,7] determined by 
solving the local homogenous problem is taken to dominate and the variational solution 
in this region is ignored. The local asymptotic solution, however, can only be determined 
with unknown scale coefficient(s). This scale factor needs to be determined using a 
global solution. 

Since a small region around the singular point is excluded from the variational 
solution, sufficient boundary conditions do not exist for solving the global boundary 
value problems. Additional equations are required to obtain the undetermined coefficients 
in the variational formulation. In this methodology, these additional equations are 
obtained by matching the asymptotic and variational solutions at one point near the point 
of singularity. 
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Figure 1: Concentric Cylinder Model - Global axisymmetric and local asymptotic 

problems 


ADM 

solution 
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This point on the boundary between the variational and asymptotic solution is 
identified with radial and axial coordinates, p*, 6*, respectively. At this point, conditions 
are imposed such that the normal, shear and radial stresses are continuous between the 
variational solution and the asymptotic analysis. If there are 'M' conditions required for 
the variational solution (thus determining M undetermined constants), and a "N' term 
expansion of the asymptotic solution is considered, then N+M independent matching 
conditions are required to determine both the local and global solutions. For example in 
Problem-A, there are three boundary conditions associated with the singular point in the 

variational formulation. They are the axial stress in the fiber (Gzz), the axial stress in the 

matrix (CJzz"’), the shear stress (CJrz, continuous on fiber and matrix side) vanish. These 
conditions determine three unknown coefficients in the variational solution. If a one term 
asymptotic solution is considered, then one undetermined scale constant needs to be 
obtained by matching. Thus there are four unknown coefficients requiring four matching 
conditions. Following four conditions can be written at the matching radius, p*, 
following conditions at R=Rf, Z= Z*(Z* 0): 

azz (ADM) = azz (ASYMP) in fiber 
azz (ADM) = cTzz (ASYMP) in matrix 
grz (ADM) = aRz (ASYMP) 
grr (ADM) = aRR (ASYMP) (1) 

First issue addressed is the determination of the size of the local region where the 
asymptotic solution prevails. In the current parameterization of the problem, this 
translates to determining the value for the radius, p*. An error-function [7] is employed 
to obtain the difference in stress fields predicted by asymptotic and variational solutions 
at several radii. Using such an error-basis, an optimal radius for embedding the 
asymptotic solution within the variational solution is searched. 

The error fimction used to quantify the extent of angular variation matching at 
radius, p* is given in eq.(2). The scale factor (K) in the error computation is obtained 
using a single point matching as given in eq. (3). 
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The procedures for determining the stress intensity factors (or scale factors for the 
comer case) is be established. This aspect of the methodology is closely coupled with the 
first issue of determining the region of dominance for the asymptotic solution. As these 
scale / stress intensity factors relate the local solution to the global solution of the 
problem, the determination of the scale factors is best accomplished at the matching 
point. The selection of a component of the stress field, such as the normal stress 
component shown in Eq.(3), that most accurately estimates the value of the scale factor is 
investigated. 

( o' V®" 

) (3 

In the case of ADM, results indicate that the normal and shear stresses match well 
closer to the singular point (p* < lE-4 mm) and the radial stress which is discontinuous 
across the interface matches well away from the singular point ( p* > lE-2 mm). As 


explained in the SEP report submitted in November 1997, this is due to the presence of 
the boxmdary condition in the variational solution requiring the radial stress at this 
boundary to vanish. Thus, the use of the matching technique for the ADM solution is not 
as straight forward as compared to the FEM solution. In this effort, the error between the 
ADM and the asymptotic solution and the angular variation matching is analyzed. This 
analysis identifies that a region of matching where the stress intensity ( or scale) factors 
can be determined accurately. The resuts section shows a numerical example illustrating 


the procedure. 

The issue pertaining to the changes in the interpolation of stresses obtained by the 
variational solution due to the embedded asymptotic solution is also investigated. 
Currently the radial interpolation of the stresses within the variational solution is carried 
out for different components of stress based on the assumed form (the assumed form 

shown for Gzz and a^,^, stress components in Eq. (4)) or the form derived from 
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equilibrium equations [4] . Figure 2 shows an ADM solution region with radial bounds at 
Ri and R 2 and with asymptotic solution embedded around the point of singularity, O. At 
heights Z > p* of this region, the unmodified axial and hoop stresses along the radial line 
can be interpolated using the functions shown in Eq. (4). The presence of the asymptotic 
region modifies the state of stress in the neighborhood (Z < p*) of the embedded region. 
When Z < p*, appropriate modifications to Eq. (4) are necessary. 


^zz ^zz (-^ 1 ) 


(R 2 -R) 

(R,-R,) 


^ZZ (^2 ) 


(R,-R,) 


(R2-R) 

(R 2 -R,) 


^ee (-^ 2 ) 


{R-R^) 

{R,-R,) 


(4) 


As asymptotic solution replaces the variational solution in the neighborhood of the 
singularity, the interpolation functions should be appropriately modified. This 
modification is necessary to determine the angular distribution of stresses around the 
singular points and to compute the matching error. The modified interpolation scheme, 
which includes the asymptotic stress fields within appropriate regions, is formulated as 
given in eq.(5). Using interpolating functions that utilize the asymptotic solution at the 
boundary of both the regions, (R 3 , Z) in Figure 2, are considered. With this assumption, 
the interpolating functions are written as shown in Eq. (5). 


^ZZ iR)~ ^ZZ ^R I ) 

iR}~ ^ee ^R 1 ) 


jR^-R ) 

iR2-R0 

iR^-Rs) 


, ASYMP , D \ 
+ 0-ZZ (^3) 


, ASYMP.n \ 

+ <^ee iRi)- 


{R-R2) 

(^-^ 3 ) 

{R,-RO 


(5) 


3. Software Implementation Details _ 

Two major enhancements to the ADM software are made. First is the inclusion of Airy- 
Stress function based asymptotic solution in the ADM variational environment. Second 
is the modification of the variational equilibrium solution to include the matching 
conditions shovm in eq. (1). Figure 3 shows the two major modifications to the ADM 
code to implement the functionality described in this document. 
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Several comparatively minor enhancements were made to the code to enable 
post-processing of the stresses: 

• Angular variation around points of singularity: In order to obtain the stress 
distributions around arbitrary points in an ADM model, a procedure was 
implemented to obtain stress at any point in the model. This requires 
identification of the section and layer from which the stresses need to be 
computed for that location and construction of the solution. This modification 
allows querying any location in the ADM model to obtain stress components 
in global axisymmetric and local plane strain model. 

• Logarithmic radial variation: The capability to print stress fields along 
arbitrary directions from any point in the model was implemented. The steps 
for traversal in the radial directions can be both linear and logarithmic. This 
capability is crucial in examining stress fields at very close distances around 
the singular points. 

During the project period another two substantial enhancements were attempted. First 
was the determination of displacement fields based on the stress fields. This issue 
remains open. The second is the implementation of higher order particular solutions. 
This enhancement was partially completed. These two enhancements are considered to be 
beyond the scope of the present project effort and no additional effort was made during 
the project period. However, the code reflects the partial advances made in these two 
directions. 
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Boundary Condition Dependent 
Coefficients for Variational Solution 


Scale Factor for Asymptotic Solution 

Figure 3: Schematic of additions and modifications made to the ADM Software 
(Grey boxes indicate added functionality) 


4. Results AND Verification _ 

Using example problem-A, the original and modified solutions are compared. An 
arbitrary matching distance of p* = lE-04 mm is chosen for this example. This problem 
has a fixed mixity of normal and shear stresses and hence has one undetermined scaling 
constant for the local asymptotic solution. Three equations are replaced with the 
matching equations given in Eq. (1). The fourth relation in Eq. (1) is vvritten as an 
additional equation into the variational equilibrium system. This combined system of 
global boundary conditions and matching conditions are solved simultaneously to 
determine the unknown coefficients in the ADM solution and the scale factor of the local 
solution. 
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Figure 4-Figure 7 show the axial variation of the original and modified solution 
plotted along the fiber-matrix interface for several stress components. It can be seen that 
the solution away from the local singular point is unaffected and the solution in the 
singular region corrects towards correct asymptotic behavior. This plot also shows that 
the matching distance from the variational solution perspective should be larger as the 
influence of the boundary condition (as seen as a local maximum near the singular point 
in the axial and shear stresses) seem to be of the order of p = 0.1 mm. 



Figure 4: Original and modified solutions for axial stress (Fiber-Side) along the interface 
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Figure 5: Original and modified solutions for axial stress (Matrix-Side) along the interface 



Figure 6: Original and modified solutions for radial stress along the interface 
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Figure 7: Original and modified solutions for shear stress along the interface 

Figures 4-7 also show that the modified solution and the original ADM solutions 
are identical away from the point of the singularity. Thus the modification made only 
disturbs the solution locally. 

The angular variation of the original solution was then compared with that obtained 
form the asymptotic method. Figures 8-11 show the angular variation matching at 0.0001, 
0.001, 0.002, and 0.02 mm away from the singular point. The variational solution is seen 
to capture two of the three stress components within the meridian plane, namely the 

opening stress component (Grr) and the shear stress component (Gr/) close to the 

singular point. The axial stress component (Gzz), however, can only be matched away 
from the comer or singular points due to the influence of the boundary condition 
coincident with the singular point. This behavior is further investigated using the error 
function. 
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Figure 8: Angular variation matching with ADM solution at p*=lE-4 mm 



Figure 9: Angular variation matching with ADM solution at p*=lE-3 mm 
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Figure 11: Angular variation matching with ADM solution at p*-lE-2 mm 
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The error as defined in eq.(2) was computed for the variational solution with the 
matching distance. Figure 12 shows that the error in the angular variation between the 
asymptotic and variational solution increases of the matching radius is increases beyond 
4.5 pm or reduces below that value. This distance is apparently optimal where the first 
term in the asymptotic solution is dominant and the radial stress component shows the 
discontinuity across the interface. 

Figure 13 and Figure 14 show the variation of the normal stress and shear stress 
as determined by the ADM solution near the singular point. The precision of the 
computational solution is course that no variation of the interface normal stress 
component(CT 0 e) is seen after a distance of 10'^ mm. Therefore, a limit on the distance of 
matching is also imposed by the size of the layering in the ADM methodology. 
Comparison with the FEM solution shows that the best possible angular variation 
matching for this case is obtained at a distance of p* = 8.8E-9 mm. The angular 
variations of the asymptotic and FEM solutions match accurately around the point of 
singularity at this distance. Note that the error in K value grows to 10% at p* = 3E-7, and 
to 44% at p* = 6.6E-4. Therefore, more refined variational solutions may be warranted to 
obtain the scale factors accurately using the matching technique. The modified 
variational-asymptotic solution determines the scaling factor closer to that of the FEM 
solution. Figure 13 shows that the hybrid variational asymptotic solution is very sensitive 
to the matching distance in the region lE-3 < p* < lE-2. 

The conclusion that emerges from this investigation is that the optimal matching 
distance for obtaining the stress intensity and scale factors for variational solutions is 
farther away from the singular points than those in the FEM solutions. There is a very 
small region in which this comparison can be effectively made. 

Table 1: Variation of Error and Scale Factor K for FEM solution 


1 Matching Radius 

K 

Error 

% Change in K I 

8.80E-09 

0.019731026 

0.003984548 

0.00 

9.65E-09 

0.019375432 

0.004461569 

1.80 

1.39E-08 

0.019208927 

4.51 E-03 

2.65 

3.05E-08 

0.018879918 

4.98E-03 

4.31 

6.63E-08 

0.018528034 

5.48E-03 

6.10 

3.05E-07 

0.017748027 

6.81 E-03 

10.05 

1.39E-05 

0.015088143 

1.22E-02 

23.53 

0.000663204 

0.010926935 

2.62E-02 

44.62 
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5. Remarks 


The hybrid variational-asymptotic solution has the potential to be a promising 
methodology for obtaining stress intensity factors from global variational solutions. 
However, several challenges emerged during the project period. A conclusion emerges 
from this work that the region of matching the local and global solutions is further away 
from the singular point than that in the FEM. A matching distance where the asymptotic 
and variational solutions have similar jump at the interface and the hybrid-variational 
asymptotic method can be used to obtain fairly accurate stress intensity factors. 

Other alternative solutions methods were investigated during the project period. 
These alternatives and the outcomes of the investigations are briefly sununarized in this 
section. 

• Formulation of variational solution with power-type of radial terms: The 
problem for the formulation arises out of the differences in the local and global 
solutions. While the global axisymmetric solution is obtained by functions 
separable in R and Z coordinates, the local solution is separated as functions of 
p and 0. This leads algebraic complexity that seems insurmountable at this 
point. 

• Applicability of Morley’s Method: Considerable effort was made to device a 
method for applying Morley-type of method for embedding radial power-type 
of terms into the current variational formulation. Morley’s method is best 
applied when the local disturbance can be expressed in terms of a common set 
of basis functions. It is not obvious to this investigator as to how this can be 
extended to the situation when a local field is known in terms of one coordinate 
system and function form and the global field is in a completely different 
coordinate system and functional form. There may be some difficulties due to 
the fact that the asymptotic solution used is a plane-strain formulation rather 
than axisjmimetric in nature. 

• The displacement continuity, higher order particular solutions, and the more 
refined layering schemes are refinements that can lead to accurate solutions near 
the points of singularity. 
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Stress (MPa) Scale Factor (K) 



Figure 13: Variation of normal and shear stress before and after modification 
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Figure 14: Variation of Axisymmetric Stress Components along the Interface (0-90,0 < p* < Rf) 
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Appendix - A: SFP Report - Refined Reissner’s Variational 
Solution in the Vicinity of Stress Singularities 


Introduction 

The ability to predict damage initiation and groAvth in composite structures relies on 
accurate stress solutions and simulation of micro-mechanical damage processes. Pagano [1] 
applied the approximate elasticity model based on Reissner's variational formulation [2] for the 
prediction of micro-mechanical failure modes in brittle matrix composites. Tandon and Pagano 
[3] recently investigated the failure modes and stress states of a single fiber pullout model. The 
solution technique, based on Reissner's variational formulation, entails dividing the structure into 
regions, assuming the distribution of some components of a stress field within a region, 
determining the spatial structure of the other components that satisfy the equilibrium and 
boundary conditions, and inter-region continuity conditions. Pagano [4], Kurtz and Pagano [5], 
and Brown [6] describe the extension of this technique to axisymmetric concentric cylinder 
model. 
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The single fiber axisymmetric model was also investigated earlier by Pochiraju [7] using 
a local asymptotic and a global finite element analysis. This effort employs a local-global 
matching method to effectively determine the detail spatial structure and magnitude of the locally 
singular stress field as well as the complete global stress distribution in the single fiber concentric 
cylinder model. The local-global matching method consists of three components: a local analysis, 
a global analysis, and proper matching of the local asymptotic field to the global complete stress 
field. (Pochiraju, Lau, Wang [8]). The local regions are identified as the areas where the 
geometiy and the presence of interface leads to stress accentuation and possibly a singularity. In 
such local regions, asymptotic solutions are obtained for several fiber-matrix interface 
conditions. In order to determine the effects of applied thermal residual loading, mechanical 
pulling and boundary conditions on the local fields, a global finite element analysis is performed 
with mesh refinement in the local regions. No singularity information is embedded into the finite 
element analysis. Matching the angular variation and radial variation of stress fields obtained 
from the finite element analysis and with those obtained from asymptotic analysis determines the 
nature of the local stress fields in the local regions. The method as developed is applicable for 
various fiber-matrix interface conditions: namely perfectly bonded interface, partially debonded 
interface with interfacial crack, or debonded interface with frictional interface sliding [9]. 

The motivation to solve for the local stress field is that these accentuated stresses, strains 
and energies are likely to induce damage. The research presented in this report is motivated to 
include the characteristics of asymptotically singular stress fields within the Reissner’s variational 
solution. There are several significant differences in the nature of the variational and asymptotic 
solutions and are listed below: 

• The variational solution solves the axisymmetric equilibrium, boundaiy conditions and 

the constitutive relationships. Separable solutions with power type of radial 
singularities exist for plane strain problems. However, Zak [10] showed, that through 
the transformation of coordinates from global to local systems, that the axisymmetric 
solution degenerates (curvature terms become insignificant) to the plane strain 
solutions in small local regions around a point. 

• The variational solution for axisymmetric problems is separable in the axial ( Z 
) and radial ( R ) directions. The asymptotic stress fields are the solutions 
separable in the polar coordinate system (p, 0) with the origin at the point of 
singularity. 

• The variational solution is determined by assuming the radial ( R ) distribution of 

two of the four stress components and determining the functional form of the 
other components by the minimization of the potential energy. This is a 
complete solution and satisfies the equilibrium and boundary conditions. The 
asymptotic solution is a partial solution and consists of undetermined 
coefficients. These coefficients may be obtained from loading and boundary 
conditions or by comparison with a global complete solution. 

The variational formulation, as implemented in the Axisymmetric Damage Model, 
assumes a linear radial variation of the axial and hoop stresses within any solution 
element (Layer) and the axial variation is determined as a solution to a first order 
differential equation. It may be conceptualized that an assumption of stress fields of the 
type given in Eq. (1) for M singular fields and with N terms/field expansion of 
asymptotic solution for two stress components and deriving the nature of the other 
components using the variational technique, but such an effort would be algebraically 
tedious, requiring lengthy development for concept validation. _ 
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Methods for enhancing the solutions near bi-material cracks and comers by selectively 
using the asymptotic solution are investigated. 

Comparison of Asymptotic and Variational Solutions 


In this section, the asymptotic and axisymmetric variational solutions are compared 
for several crack and comer problems. Figure 1 shows schematics of the concentric 
cylinder model along with the R-Z meridian plane for the axisymmetric problem, and the 
local solution for a typical fiber pullout problem. A local region is shown in the detail-A 
which surrounds the point of singularity, O. First a short description of the asymptotic 
solutions in the local region along with a method of matching the local solution with a 
global numerical solution is presented. The variational solution is summarized in the 
second sub-section. An algorithm for determining the angular variation of stresses around 
the singular point from the variational solution is presented. A series of problems 
involving cracks and comer singularities are investigated. A comparison angular 
variation of the stress distributions obtained from the variational solution and asymptotic 
analyses is presented. This comparison is conducted at a selected radius where the 
leading singular term in the asymptotic analysis is believed to be dominant. A second 
global solution from Finite Element Method (FEM) is also presented for reference. 

Local Asymptotic Solution 

The point, O, is the location of material and geometric discontinuity. The local 
stresses at O may be singular in nature depending upon the fiber and matrix properties 
and the fiber-matrix interface conditions. In order to determine the spatial distributions 
of the stress fields in the local region, the governing equations are formulated for 
axisymmetric stresses in {R, Z, O} coordinates and then transformed to the local {p, 0} 
coordinates, retaining only terms valid as p -> 0 . Stress fields separable in the p & 

0 coordinates and with radial power-type of singularities are considered. Eq. (2) shows 
the function form of stress components near the point of singularity as determined from 
the asymptotic analysis. 

(^ij= kp^^ij(P) (?^ < 0; p 0) (2) 

The angular distribution functions, Oy {6) ,for the matrix-domain and fiber-domain 

are determined from the Williamsfl 1] type of stress functions. Eq. (3) shows the typical 
stress function required to derive the stress components for axisymmetric problem. 

(p{9) = A cos(Z + 3)^ + B sin(Z + 3)^ + C cos(/l)^ -t- D sin(Z)0 
(3) 

The traction free boundary conditions at 0=0 and 0=37i/2, and the perfectly-bonded 
fiber-matrix interface conditions at 0=7r/2, form a system of eight homogeneous 
equations ([7]): 


[Qj(?.)]A = 0 


( 4 ) 
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where X is the eigenvalue, and A -[A^, D™, A^, C^, is its 

associated eigenvector. For nontrivial solution, the determinant of IC mn C^)] niust vamsh. 
The eigenpair {>,, A} determined from Eq. (4) provides the p- and 0-dependent structure, 
and ajjCe), of the singular field. A determinant search algorithm was used to find the 
eigenvalue X within the admissible range of 0 > A->-l. For some fiber-matrix 
combinations, there can be more than one solution for X in the admissible range. We are 
concerned with the smallest admissible X, which gives rise to the most dominant singular 

stresses. The G-dependence of the singular field, 5^(0), is determinable from solution of 

the eigenvector A to within an arbitrary constant. We normalized aij(0) by choosing the 
arbitrary constant so that (6 ) = 1» where 0* is the angular position of points on the 
interface. 

The amplitude factor k of Eq. (2) is to be determined from the full-field solution. 
This is accomplished by appropriately matching the singular field with the full-field 
stress distribution provided by an analyses methods such as FEM or other variational 
methods. This matching was carried out earlier [8,9] using the FEM. A displacement- 
based axisymmetric finite element model, as shown in Figure 16, with quadratic 
Lagrange elements was used to compute the global full-field solution. Near the fiber- 
protrusion point O, elements were arranged in concentric rings with boundaries given by: 
(p/Ro)n = 10’^ {1+ (100-2)n-l} where n denotes the n-th ring. The first ring encircling 

the fiber-protrusion point consisted of quadratic triangular elements degenerated from 
Lagrange quadrilateral elements which formed the rest of the mesh. No special 
implementation for singularity was imposed on these elements. Within the finite element 

spatial domain, we searched for a sub-region where the full-field stresses, ajjfr.G) FEM^ 
have the identical 0-dependence as that of the singular solution, 5^(0) . If the finite 


element stresses match 0^(0), then they also match the eigenvector A and its associated 
eigenvalue X. One may choose any stress component, say qqq , at any angle, say 0*, to 
determine k: 


,FEM 


k = ^ 


(p,o^y 
^¥1 




(5) 


However, starting from the second ring of elements, results show good matching 
characteristics typical of Figure 16. It is expected that as p/R© increases, the one-term 


singular solution will deviate from the full-field solution, resulting in less spectacular 
angular matching at sampling points with large p/Ro- ^ root-mean-square error 


parameter, E, was developed to quantify mismatching along any arc of constant radius p. 
Finite element results were sampled at N points of varying 0-value along the arc. At each 


sampling point p (at 0p), the average of the squares of the mismatch for three stress 
components. The expression for the error measure is shown in Eq.(6). E is zero when 
there is a perfect match. K is determined from the regions where E is minimum. 
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Variational Solution 

The axisymmetric damage model was developed in order to approximate the elastic 
stress field and energy release rates of bodies in the form of concentric cylinders and 
contains damaged regions in the form of annular or penny-shaped cracks in the 
constituents and/or debonds between them. The model is generated by subdividing the 
body into regions consisting of a core and a number of shells of constant length and 
satisfying the Reissner variational equation with an assumed stress field in each region. 
That is, we set 


o 

II 

(7) 

J=^FdV- jf,U,ds 

(8) 

K S' 



(9) 


In these equations, W is the complementary energy, Tij and Ui are the stress and 
displacement components, respectively, in Cartesian coordinates, eij are the mathematical 

free expansional or non-mechanical strains, V is the volume enclosed by S, S' is the part 
of S on which one or more traction components are prescribed and T are the Cartesian 
components of the prescribed tractions. Further, a comma followed by a subscript(s) 
implies differentiation with respect to the appropriate coordinate(s) and the summation 
convention is understood. 

Reissner (1950) has shown that the governing equations of elasticity can be 
obtained as a consequence of the variational equation provided both stresses and 
displacements are subject to variation in the application of (7). The stress field is assumed 
such that azz and aQQ are linear in R within each region, while the forms of and cr^ 
are chosen to satisfy the axisynunetric equilibrium equations of linear elasticity. Letting, 

~ ^RR’^] ~^RZ ( 10 ) 

(and the analogous relation for the engineering strain components si (i=l, 2, 3, 5) and 
hygrothermal free expansional strain components ei (i = 1, 2 3)), we arrive at the relations 
in the region R, < R< Rj'. 

( i=L2,3,5;J=l,2,...5) (11) 

where are known shape functions of R defined such that/?^„ = a‘i{R^,Z){ i = 1,2,3,5; 

J=l,2,...5) (12) 

In order to maintain consistency with our postulated stress field, we further assume 
that eqs. (11) hold on the boundaries, as well as within the medium itself The remaining 
dependent variables follow directly from the mathematics without further assumptions. 
They are the weighted displacements: 
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( 13 ) 


(u,u*,u,u)= j^'u(\,r,r^ ,r^)dr 
(w*,w) = w(r,r^)dr 


(14) 


The interfacial displacements Ua, wa (a = 1, 2), or displacements on the radial 
boundaries R = ri, r2, only enter the formulation if they are prescribed or if we have 
relative slip with friction at the interface. For a body composed of a core region plus N 
shells, the formulation leads to the solution of 18 N + 16 algebraic and ordinary 
differential equations in Z. On planes, Z=constant, the weighted displacement or traction 
quantities are specified in certain pairs. Furthermore, continuity conditions can be written 
for the surfaces R = constant which are internal to the medium. Finally, the boundary 
conditions on z = zi, z 2 , or end conditions, can be expressed by prescribing one term 
from each of the following products The regions are selected such that the thermoelastic 
properties are constant and the boundary conditions do not change character on any of the 
bounding surfaces (i.e., traction and/or displacement components prescribed or continuity 
prescribed) within each region. The number of regions, in particular in the r direction, can 
be increased in order to improve the solution accuracy. 

Since the field equations within each material are linear differential equations with 
constant coefficients, the general form of the solution for any of the dependent variables 
P(z) is expressed by: 

P(Z)=P,(Z) + Y^Ae^‘^ +P^(Z) (15) 

i 

within each layer where Ai are constants, X,t are eigenvalues of a determinant, Ph(z) is the 
polynomial type of homogenous solution (repeated roots), and Pp(z) is a particular 
solution, which in the present case is a simple polynomial. This completes the boundary 
value problem formulation with the variational model. 

Angular variation determination from variational solution 
To tabulate the stress fields in the ADM solution in a region, following algorithm is used. 

1. Given the point of singularity, Ro and Zo 

2. Given p, generate a series of0 values from 0| start to 02 at A0 increment 

3. For every p and 0: 

4. Determine the global coordinate R and Z 

R = Ro + p cos(0) 

Z = Zo - p sin (0) 

5. Determine the section number, isec, based on the Z value 

6. Determine the section properties for isec 

7. Determine the layer number, ilayer, based on the R value 

8. Do a transformation of Z to local coordinate, Ziocai of the section and layer. 

9. Construct the solution for the unknown quantities [4] for that layer at Z Zlocal 

10. Store the solution quantities and their derivatives 

11. Do radial interpolation using the appropriate equations for core and coating: 

For core: 

o-zz o-zz ('• 
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(y _^ \ 

^Rzir) =<yRzir^)- - \ + -r2)r 

(^1-^2) 

o’mW = + (t' (r^ -r/)r + cr"(r' -r^ )r 

(n-^2) ('•i-'- 2 ) 


For Coating: 


(r, - r,) (f - f,) 

<^zz('-)=o-zz(''i)^;^7^+<^z2(''2)^;p:^ 


(0 = <^RR ('' 1 ) 7 ^—^ (^ 2 )- - + O’!, (f' - r^){r" - n+ r,r, (r, + r ,)) 

('■i-'' 2 ) (''i-'' 2 ) 


=o-Rz(''i)- + «^Rz(—) 


/^^(^-^l) ,^/ J/. _(r, +r,r 2 +r 2 )r^l 


+<A(ri+f2y -■ 


'“> 2 '(r,-r 2 ) r 

where o', o", and o'" correspond to the solution quantities as given in Eq. (11) [4]. 

12. Increment 0 by A9 

13. Check if 0 = 02, YES: Stop, NO: goto 3 

Table 2: Material properties and power of singularitv for conFiEuration in Problem-A 


Material 

Ef 

Vf 

Em 

Polyester/Epoxy 

3.25 GPa 

0.35 

4.0 GPa 

Silicon Carbide/ Borosilicate 

410 GPa 

0.2 

63 GPa 



0.00463 


0.11206 


Comparison of ADM and Asymptotic Solutions 

The algorithm presented in the previous section was implemented into the current 
axisymetric damage model. The results presented in this section are for unmodified 
variational solution. Several problem involving crack and comer geometry were then 
investigated to determine the radial and angular distribution of the stresses predicted by 
the variational model. In this report, the following four case studies are presented. 

Problem A: Comer singularity in the single fiber push-out model: Figure 17 shows 

the geometry of the axisymmetric concentric cylinder model typically 
employed to simulate fiber push-in experiments. The singularity at the ends of 
the model is material dependent is given for two material combinations in 
Table 1. 

Problem B: The problem of an interface crack with the debonded faces sliding with 

fiiction is considered. The local coordinate system at the crack tip and the 
geometry of the model is shown in Figure 18. The fiber and matrix materials 
are taken to be Polyester and Epoxy, respectively. The coefficient of friction is 
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assumed to be 0.52. These values correspond to recent push-in tests conducted 
by Bechel [12]. The power of the stress singularity for this case is -0.49605. 
Problem C: The penny shaped crack in a fiber is considered as the third example. 

For this case the stress singularity is -0.5 and is material independent. The 
mode-1 loading case is considered and for this case the exact stress intensity 
factor for the infinite body is also known. Figure 19 shows the local 
coordinate system. 

Problem D: The problem of a bi-material crack with open, traction-free, crack faces 

is considered. This is the case where the stress singularity is complex-valued. 
(A,=-0.5 + 0.0076 i) Again polyester-epoxy system is considered. The local 
coordinate system is shown in Figure 20. 

The comparison of angular variation for problem A reveals the nature of the 
variational solution around the comer singularities. The asympotic behavior of the 
opening stress (aee) and the shear stress (Ope) are captured by the variational solution. 
Figure 21 and Figure 22 show the comparison of the variational and asympotic solution 
for problem A. The comparison is conducted at a distance of 1 pm from the singular 
point and the variational solution shows a good correlation with the field obtained from 
asymptotic analysis. Figure 23 shows the angular distribution of the radial stress 
(corresponds to the axial stress azz in the axisymmetric coordinate system) for 
asymptotic and variational solutions. The variation of the radial stress (on-) significantly 
differs from the asymptotic behavior. The variational solution tends to a zero value at the 
interface consistent with the boundary condition imposed at that point in the solution. 
Figure 24 shows the radial variation of the axial stress component (ozz) in the fiber and 
matrix. This plot compares the stress distributions obtained from the FEM and variational 
solutions. There is a good correlation of the stress distributions between the two solutions 
in regions away from the singular point (at z = 0). This behavior is due to the boundary 
conditions imposed in the variational solution which require that the axial stress 
component (azz) should vanish in both fiber and matrix at z = 0. 

Problem-B is the frictional crack problem. The angular variation matching is 
conducted comparing asymptotic (Comninou-type of crack [13]), FEM and variational 
solutions. Again the opening stress (aee) and the shear stress (ape) are compared. The 
comparison shows additional characteristics of the variational solution. The opening 
stress. Figure 25, shows a good match between all the three cases though the FEM 
analysis is an incremental analysis and the variational and asymptotic analyses are 
independent of the load history. The shear stress. Figure 26 , shows that the overall 
distribution of the stress component in the vicinity of the stress singularity is comparable, 
the discontinuities in the shear stress component in the ADM solutions correspond to the 
jump between the sections where shear stress continuity is not imposed. 

The penny shaped crack in a polyester fiber is considered. The applied loading 
was pure mode-I and the exact stress intensity factor is known. Figure 27 shows the 
comparison of the opening stresses between asymptotic and ADM solutions. There is a 
good correlation between the ADM and asymptotic solutions. The shear stress angular 
distribution is shown in Figure 28. Once again the jump across the sections in the 
variational model causes the discontinuity of the shear stress. The results of the angular 
variation comparisons are presented for the bi-material interface crack problem in 
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Figures Figure 29 and Figure 30. The variational solution captures the opening and shear 
stress distributions as predicted by the asymptotic solution for this case also. 

Stress intensity (Scale) factors can be obtained by single point matching method 
as described in the previous sub-section. However, since an error basis was not developed 
for matching using the variational solution, the radius and the component of matching is 
arbitrary and may be subjected to a large error. Therefore, the work on stress intensity 
factor determination is not presented here. 

Refining Variational Solution with Asymptotics 

In this section a technique for embedding the asymptotic solutions within selected 
regions of the model is described. This technique modifies the variational solution while 
ensuring that the global equilibrium, boundary conditions and the constitutive 
relationships are satisfied and maintaining stress continuity between the asymptotic and 
variational solution at a selected point inside the domain. 

Modified Variational Solution 

Consider the axisymmetric model, corresponding to Problem-A, as shown in Figure 
31. In obtaining the refined solution, a region, of radius p*, is identified around the 
singular point and in this region the asymptotic solution determined by solving the local 
homogenous problem prevails and the obtained variational solution is ignored. The local 
asymptotic solution however is determined with unknown scale coefficient(s) which 
depend upon the global solution. This scale factor needs to be determined using a global 
solution. 

Since a small region around the singular point is excluded in the variational 
solution, several boundary conditions are also excluded in this region. This requires 
additional equations so that the undetermined coefficients in the variational formulation 
can be computed. These additional equations are obtained by matching the asymptotic 
and variational solutions at selected radius 

A point on the boundary between the variational and asymptotic solution is 
identified (p*, 9*). At this point continuity conditions are imposed such that the normal, 
shear and radial stresses are continuous between the variational solution and the 
asymptotic analysis. If there are 'M' boundary conditions associated with the singular 
point in the variational solution (thus determining M undetermined constants), and a 'N' 
term expansion of the asymptotic solution is considered, then N+M independent 
matching conditions are required to determine both the local and global solutions. For 
example in Problem-A, there are three boundary conditions associated with the singular 
point in the variational formulation. They are the axial stress in the fiber {aj), the axial 
stress in the matrix (azz""), the shear stress (ctrz, continuous on fiber and matrix side) 
vanish. These conditions determine three unknown coefficients in the variational 
solution. If a one term asymptotic solution is considered, then one undetermined scale 
constant needs to be obtained by matching. Thus there are four unknown coefficients 
requiring four matching conditions. Following four conditions can be written at the 
matching radius, r*. Replace these conditions at R=Rf, Z= 0 by following conditions at 
R=Rf, Z= Z* 

@ R==Rf & Z=Z* (Z* ^ 0 ) 

_ azz (ADM) = azz (ASYMP) in fiber _ 
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azz (ADM) = CTzz (ASYMP) in matrix 
aRz (ADM) = aRz (ASYMP) 

Crr (ADM) = CTrr (ASYMP) (16) 

Software Implementation 

The current ADM software implementation was modified to include the asymptotic 
behavior. Following enhancements were made to the software to include the asymptotic 
analysis and the refinement to the solution by embedding the asymptotic solution within 
the variational solution. 

• A framework for handling several singular points vrithin a model was 

implemented. Each of the singular points can have different powers of 
singularities and angular variations. This enables the solution modifications 
described in the previous section to be applied at several spatial locations 
simultaneously. 

• A local homogenous problem can be described so that power of the stress 
singularity and the angular distributions can be computed within the ADM 
solution. The user identifies the point of singularity and describes the local 
homogenous boundary conditions. The eigenvalue and eigenvectors for the 
local problems are computed from within the software. 

• The computation of the unknown coefficients of the solution (Eq. 15) is 
modified so that the boundary conditions corresponding to the singular point are 
removed and the equations corresponding to the matching conditions shown in 
Eq. (16 ) are formulated. The solution of the new system of the equilibrium 
equations is carried out simultaneously determining the unknown coefficients 
in the variational solution as well as the scale factors for the asymptotic 
fields. 

• A post-processing enhancements to determine the modified solution by 
considering the presence of the asymptotic solution within a small region is also 
developed. 


Table 3 : Asymptotic Solution 


POWER OF 

SINGULARITY: A, = -.1121 

COEFFICIENTS OF ANGULAR VARIATION 

1 

1.0000 (Arbitrarily Set) 

2 

1.8428 

3 

-1.3832 

4 

-1.3019 

5 

4.9462 

6 

1.0667 

7 

-6.2097 

8 

0.2440 

Mixity 

= .1567 


* ^ ^ --II 

Stress 

Original (MPa) 

Modified (MPa) 

^zz 

-0.00002 

-0.00835 


0.01834 

0.01345 
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^RR 

0.03112 

0.02339 

^RZ 

0.00085 

0.00315 


Characteristics of the refined solution 

Using example problem-A, the original and modified solutions are compared. An 
arbitrary matching distance of p* = lE-04 mm is chosen for this example. The results of 
the local asymptotic solution is given in Table 2. This problem has a fixed mixity of 
normal and shear stresses and hence has one undetermined scaling constant for the local 
asymptotic solution. 

A eleven layer variational model has 86 coefficients determined through the global 
equilibrium solution. In this system, three relations corresponding to the boundary 
conditions at the point of singularity are identified in the software as equations 18, 20 
and 21. Following the method presented earlier, three equations are replaced with the 
matching equations given in Eq. (16). The fourth relation in Eq. (16) is written as an 
additional equation (87*^) into the equilibrium solution. The variational equilibrium 
solution is solved along with the additional equation that determines the scale factor. 
Table 3 presents the original and modified state of stress at the matching point 
(p*,0=9O). 

Figure 32 shows the axial variation of the original and modified solution plotted 
along the fiber-matrix interface. It can be seen that the solution away from the local 
singular point is unaffected and the solution in the singular region corrects towards 
correct asymptotic behavior. This plot also shows that the matching distance from the 
variational solution perspective should be larger as the influence of the boundary 
condition (as seen as a local maximum near the singular point in the axial and shear 
stresses) seem to be of the order of p = 0.1 mm. The appropriate distance for the 
matching is currently being investigated. 

Summary 

In this summer project period, the characteristics of the variational solution in the 
vicinity of a stress singularity are investigated. Though the variational formulation does 
not contain any features of the asymptotic solution, the solution reflects the asymptotic 
nature in two of the three components. Effort was also focused on refining the solution 
near the stress singularities. This resulted in a technique that modifies the global 
equilibrium solution in the variational formulation such that an exact asymptotic solution 
is embedded near the point of singularity, continuity of stresses is maintained between the 
asymptotic solution and the variational solution. It is also shown that such modification 
effects only the local regions of the global solution and does not influence the solutions in 
the regions away from the singular points. 

The modification to the variational solution presented here requires detailed 
investigation of several important issues. 

• First is the determination of the radius, p*, where both asymptotic solution and 
the ADM solutions are effective. In the proof of concept investigation 
presented here, this distance is specified arbitrarily based on the prior 
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experience with the angular variation matching. Identification and validation of 
the error measures which can be used to establish an effective radial matching 
distance is crucial. 

• The second investigation required is to generalize the methodology several types 

of singular fields within the ADM model. The boundary conditions 
corresponding to the variational solution differs from crack problems to comer 
problems. Based on the findings and the development presented in this report, a 
general procedure of modifying the global equilibrium system of equations 
within in the variational formulation can be established. 

• As mentioned in the report, for some problems there are several terms in the 

asymptotic expansion which could be singular. Considering the powers of the 
asymptotic expansion higher than the most dominant singular term may be 
required. Considering the additional terms in the asymptotic solution will 
increase the domain in which the solution prevails. This creates additional 
flexibility in accurately finding a matching radius. 

• The variational solution as implemented in the ADM model considered 

orthotropic materials. The asymptotic expansion presented here is valid for 
isotropic methods. A preliminary investigation into the nature of stress fields 
near the interfaces of the orthotropic materials and its effect on the presented 
methodology may be addressed. 
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Figure 15: Concentric Cylinder Model - Global axisymmetric and local asymptotic problems 
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Figure 16: FEM mesh and angular variation matching. 
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Figure 17: Problem A: Bi-material corner problem 

® Alii 



. ^ Figure 19: Problem C- Penny shaped crack 

Figure 18: Problem B - Frictional Crack problem ^ 



Figure 20: Problem-D: Bi-material interface crack, (Complex singularity) 
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Figure 21: Opening stress angular variation comparison for problem - 



Figure 22: Shear stress angular variation comparison for problem - A 


US AFOSR SREP Contract: F49620-93-C-0064; Final Report 
Sub-Contract: 98-0833 


Page 34 of 40 


8-34 









Figure 23: Radial stress angular variation comparison for problem - A 
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Figure 27: Angular variation matching for penny shaped crack problem (Problem-C) 
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Figure 31: Refining the variational solution by embedding asymptotic solution 






Figure 32: Original and modifled solutions for problem - A 
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CONTINUOUS SLIDING MODE CONTROL APPROACH FOR ADDRESSING ACTUATOR DEFLECTION 
AND DEFLECTION RATE SARURATION IN TAILESS AIRCRAFT CONTROL AND RE-CONFIGURABLE 

FLIGHT CONTROL 


Dr. Yuri B. Shtessel 
Associate Professor 

Department of Electrical and Computer Engineering 
University of Alabama in Huntsville 
301 Sparkman Drive 
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Abstract 

A dual time scale aircraft flight control problem is addressed via continuous sliding mode control. Sliding 
surface boundary layer reconfiguration is used to account for actuator dynamics, deflection limits and rate limits. A 
re-configurable sliding mode flight controller is designed that achieves robust, high accuracy tracking of outer loop 
command angles before and after damage to an aircraft. Angular rate commands are robustly tracked in an inner 
loop. The re-configurable flight control strategy is based on a continuous sliding mode controller with direct 
boundary layer adaptation for re-configuration. On-line explicit system or damage identification is not required. 

The continuous sliding mode controller with a finite reaching time is designed for multi-input-multi-output 
nonlinear control systems. Enhanced robustness issues are addressed. Computer simulations confirmed high 
robustness and accuracy of the designed controller. The developed controller design technique is incorporated into 
continuous SMC design in outer loops of traditional and tailess aircraft. 

The re-configurable sliding mode flight control technique is applied to a nonlinear flight dynamics model 
of an F-16 aircraft and to a linearized flight dynamics model of a tailess aircraft, developed under the Innovative 
Control Effectors (ICE) program. Computer simulations demonstrate stability and high accuracy tracking 
performance without violation of actuator limits. 
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CONTINUOUS SLIDING MODE CONTROL APPROACH FOR ADDRESSING ACTUATOR DEFLECTION 
AND DEFLECTION RATE SARURATION IN TAILESS AIRCRAFT CONTROL AND RE-CONFIGURABLE 

FLIGHT CONTROL 


Dr. Yuri B. Shtessel 
Associate Professor 

Department of Electrical and Computer Engineering 
University of Alabama in Huntsville 
301 Sparkman Drive 
Huntsville, Alabama 35899 

Introduction 

1-4 

Re-configurable flight control has been extensively investigated by the United States Air Force due to 
safety and performance benefits of the technology. RecenUy, indirect adaptive control approaches to flight control 
re-configuration have been employed which supply information of system failures or damage to control law gain 

computation algorithms'"^. On-line parameter identification has been done analytically^’'* or on the basis of a rule- 

based expert system^’^. Identified aircraft parameters are supplied to control algorithms which re-configure control 
laws to minimize loss of aircraft performance in the presence of control effector deflection and deflection rate 

limitations^'^. The most severe challenges of indirect adaptive flight control approaches are development of timely 
on-line aircraft parameter identification algorithms and directly accounting for actuator saturation. Failure to 
overcome these challenges may lead to significant tracking performance degradation, pilot induced oscillations and 
possibly loss of stability. Insensitivity and robustness of Sliding Mode Controllers (SMC) to plant disturbances and 

uncertainties makes SMC attractive for flight control reconfiguration. A SMC design consists of two steps. In 
the first step, a sliding surface is chosen such that the closed loop system motion on this surface exhibits desired 
behavior regardless of plant uncertainties, and disturbances. In the second step, a control function is chosen to 
provide reaching of the sliding surface by the system state in finite time and guarantee system motion in this surface 

thereafter. System motion on the sliding surface is called a sliding mode, and strict enforcement of the sliding mode 

13 

typically leads to discontinuous control functions, which may lead to control chattering . However, the chattering 

13 14 

issue is easily addressed by a continuous approximation of discontinuous control functions ’ , or by a specially 
designed robust, continuous SMC'*. Flight control re-configuration without on-line parameter identification was 

previously demonstrated through angular rate tracking in the presence of damage. In this paper, a two-time 
scale aircraft re-configurable flight control problem is addressed via continuous SMC. Robust tracking of angle of 
roll, attack and sideslip angles is achieved through an outer loop SMC by providing angular rate commands to an 
inner loop SMC which provides robust tracking of aircraft angular rates. A sliding surface boundary layer exists for 
continuous SMC, and adaptation of this boundary layer is used to prevent violation of actuator bandwidth, 
deflection, and rate limits. The continuous sliding mode controller with a finite reaching time is designed for multi- 
input-multi-output nonlinear control systems. Enhanced robustness issues are addressed. Computer simulations 
confirmed high robustness and accuracy of the designed controUer. The developed controller design technique is 
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incorporated into continuous SMC design in outer loops of traditional and tailess aircraft. The proposed SMC re- 
configurable control technique is applied to a nonlinear flight dynamics model of an F-16 aircraft and to a linearized 
flight dynamics model of a tailess aircraft, developed under the Innovative Control Effectors (ICE) program. 
Computer simulations demonstrate robust, highly accurate tracking of aggressive commands in the presence of 
damage without violating actuator limits. 

I. Multiple Time Scale Flight Control Using Re-configurable Sliding Modes for Aircraft with Square 
Configuration 


Problem Formulation 

Consider a mathematical flight dynamics model of a rigid aircraft with the square, cascade structure 

y = /ify. ^)+Wy.5, |)+ {Bi(y,^)+ABi(y,^))z 
z = f2(y.z, +Wy.z, + {B2(y,z, + AB2(y,z, ^))5 
^ = h(y.z,4.S) 

S = -As(.S-u), 

where y e 91^ is a vector of command angles (y = {^,a, /if, and (p,a, p, are roll angle, angle of attack, angle of 
sideslip respectively); z€ is a vector of angular rates iz = {p,q,rj , and p,q,r are roll, pitch and yaw rates 
respectively); 5e9i^ is a vector of control deflections (5 = where(5^,5^,5^ are aileron, elevator, 

and rudder deflections respectively); M € is a control vector containing actuator inputs, ^ e is a vector 
containing all other states not contained in y, z, or 5; /i(y,^)69i^, / 2 (>’,z,(^)6 91^, /3(y,z,(5,5)e 91* are 
differentiable vector-functions; Bi(y,i^)e 91^"^, B2(y,z,^)s'3i^'^ are known matrices; A/i e A/j e 91^ 
ABj e 91^, ABi e 91^*^, ABj g are unknown bounded smooth perturbations, caused by damage, failures and 
modeling uncertainties; det(Bi(y,|)+ABi(y,^))^0 and de(B 2 ()'.z,^)+AB 2 (y,z,|))?iO Vy,^ e Q.. 

The deflections and deflection rates of the actuators are assumed to be bounded: 

|5..|<5„,|5.|<5„Vi = i:3. (2) 

Problem: Given a real-time command reference profile y^ (t) , design a continuous SMC to obtain the robust 
asymptotic de-coupled motion of the output tracking errors 

yci = 0 Vi = 1,3 (3) 

in sliding modes described by homogeneous time-invariant equations with given eigenvalues placement for the 
system in eqs. (1) and (2). 

Assumption /. It is assumed that the equations 

i = hiyc’Zcy^^s^) 


( 4 ) 



are stable, where 5^ is the commanded deflection that satisfies equation 5^ --Ag (5^ - u), and u is the control 
that maintains commanded states y^., Zc- 

This is a reasonable assumption since if any elements of | were not stable, it is argued that those elements 
should be moved to either y or z depending on the dynamical structure. This assumption is analogous to the 
mitiiTniiTTi phase assumption used in feedback linearization. 

Flight Control System 

The cascade structure of the system in eq. (1) and the inherent two time scale nature of the proposed flight control 
problem are exploited for design of a two loop flight control system using continuous SMCs in the inner and outer 
loops. The inner loop SMC provides robust de-coupled tracking of angular rates without violating actuator limits. 
The outer loop SMC provides angular rate commands to the inner loop. Together, the inner and outer loop SMCs 
form a two loop flight control system that achieves de-coupled asymptotic tracking of the command angle reference 
profile. Similar structure was developed** for an aircraft control system using a dynamic inversion algorithm. 

Outer Loop Continuous Sliding Mode Controller Design 
Considering the angular rate vector z as a virtual control command z^. only the command angle dynamics 

are considered for the outer loop SMC design 

The vector relative degree of eq. (5) is r =[1,1,1]^, and the sliding surface can be designed as follows'®'”: 
cr = e = 0. However, in order to have desired command angle error dynamics and to nullify a steady state tracking 
error due to continuous implementation of SMC the sliding surface is designed with an integral term in the PI fornoat 
in the first step of the outer loop SMC design 

t 

(T = e-t-C'|edT = 0, a,eeR^, 

0 

where e = f, e,- = y^,- - y.- is the command angle error and C' = diag^\ } Vi = 13 is chosen to achieve 

desired conunand angle error dynamics. 

The second step of the outer loop SMC design is to choose a control function that achieves asymptotic 
stability of the cr dynamics given as 
<t = 'P(.)-[Bi(.) + ABi(.)]z,, 


where 

4'(.) = yc-/iO-A/i(.) + C'e. 

Consider the following Lyapunov function candidate for assistance in the design of a SMC 

V = —cr^0 >0. 

2 

The derivative of the Lyapunov function candidate is 

y = CT^{'P(.)-[Bi(.) + ABi(.)]zJ. 


( 8 ) 

(9) 

( 10 ) 
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A SMC of the following form is chosen 

z, = z,^+^iB^\)SIGNa, (11) 

where SIGN C=col {sign (T;} Vi = 1,3, ^ is a positive scalar and 

(.)❖(.), 'P(-) = yc-fi (•) + C\. (12) 

The equivalent control z^ is the control that provides a zero Lyapunov function derivative similar to a 

feedback linearization term“. With the SMC in eq. (11), the derivative of the Lyapunov function is given by the 
following 

V = a'^{<i>(.)-H(I + EO)SIGNa}, (13) 

whrae 

£(.) = A5i(.)-Brn.). <D(.) = 'P(.)-(/ + £(.))'P(.). (14) 

Assuming the following bounds 

3 _ _ 

|<D, (.)| < Yi . \Eij (.)| ^ 1 = ^’3 V/ = 13 (15) 

;=i 

are satisfied, it is easy to show that 

3 ^ 3 

V<-2|cTi|- g 1-2®.; -r,- • ( 16 ) 

i=l L ;=1 _ 

3 

Requiring y<-2^|o;| will guarantee asymptotic origin convergence, for the system in eq. (7) via outer loop SMC 

i=i 

(eq. (11)), in finite time. Any desired reaching time may be achieved by proper choice of /i,- >0 The 
following choice of g will guarantee a negative definite Lyapunov derivative and finite reaching specified by pi 

g>max • (17) 

.•= 1 , 2,3 

;=i 

Remark 1. The physical meaning of the inequalities in (15) is uncertainty bounds y-dynamical system. Coordinates 
of the vector-function 0(.) = 4'(.) - (7 + £(.))'i'(.) must be bounded-( [C*,- (.)| < 7;) in the perturbed case and zero 

nominally (jO,- (.)| = 0). Elements of the relative uncertainty matrix £(.) = ABj (.) ■ (.) must be bounded in the 

perturbed case and zero nominally (jE(.) = 0). Further, it is obvious that B](.)-i-ABi(.) = (7-i-B(.))Bi(.), so the 

3 

conditions < 1 enforce a diagonally dominant structure to the matrix (7 + £(.)). 

;=i 

Therefore, the outer-loop SMC provides asymptotic stability of the a -dynamics in the presence of 
bounded uncertainties provided that there is sufficient control power. However, the angular rate command provided 
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by the outer loop SMC (discontinuous and thus is incapable of being tracked by the inner loop. A continuous 
approximation*'’*^ of the outer loop SMC in eq.(ll) is used to provide feasible angular rate commands to the inner 
loop. The discontinuous term SIGNain eq.(l 1) is replaced by the following continuous expression 

£-I£l £ir 

C Ui’C2’^:3j ’ 

13 

where are designer-specified constants which value are to be chosen to avoid chattering . 

Inner Loop Continuous Sliding Mode Controller Design 
The angular rate and actuator dynamics are considered for the inner loop SMC design 

(z = f2(y,z,^)+ (i9) 
|5 = -A5(5-4 

The actuator dynamics are included so that actuator bandwidth and rate limits may be directly accounted for in the 
design. A continuous SMC u is designed to obtain the robust asymptotic de-coupled motion of the tracking errors 

lini|z„- - Zi I = 0 Vi = 1,3 

in sliding modes described by homogeneous time-invariant equations with given eigenvalues placement for the 
system in eqs. (19) and (2). Note that the angular rate command is generated by the outer loop SMC. A slidmg 
surface that provides desired tracking dynamics is chosen in the first step of the inner loop SMC design. The vector 
relative degree of eq. (20) is r = [2,2, 2f, and the sliding surface is designed in a PID format*®-*'. This is 

s = i) + C^il + C^jvdr = 0, s,rieR\ 

0 

where v = {rivV 2 .V 3 Y > Vi=Zci-z^ and =dia8^f\ diag^^] = T3 are chosen to provide the 
desired, de-coupled angular rate tracking dynamics in the sliding mode. An integral term is added to nullify a steady 
state tracking error due to continuous in^ilementation of SMC. 

Remark 2. The inner loop sliding mode dynamics in eq. (21) must be much faster than the outer-loop sliding mode 
dynamics to preserve sufficient time scale separation between the loops. 

'The second step of the inner loop SMC design is to choose a control function u that achieves asymptotic 

stability of the s dynamics given by the following 

s = F(.)-B2Qv, B2{.) = B2Q + AB2(.) 


where 


F(.)=z,+C"z,+C' 


v = -Aa (5-i<). 


+Af2)-C^(/2 +4C2)-[4(52+ABj 5, 
dt L“‘ 


AssumpHon 2. The control effectiveness matrix B 2 is assumed symmetric and positive definite. 
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Checking the positive definiteness of the control effectiveness matrix, the signs of its columns may be 
changed with an appropriate sign change of the corresponding control law command elements. Most conventional 
flight control models nominally exhibit diagonal dominance of the control effectiveness matrix. This is 

bii>'Zh\’ *«>0Vi = U. (25) 

;=i 

i*i 

With diagonal dominance, application of the Gershgorin theorem'® provides sufficiency of positive definiteness. In 
unconventional flight control models such as aircraft with damage, the control effectiveness matrix may not be 
diagonal dominant. In this case, the Sylvester theorem'^’'® may be used to verify positive definiteness. 

The following Lyapunov function candidate is considered for design of a stabilizing SMC 

V=i/(B2)r^>0- (26) 

The derivative of the Lyapunov function candidate in eq. (26) is given by the following 

The equivalent control v„ is given by 




The following control law is constructed to provide negative definiteness of the Lyapunov function derivative 
v = v,. + pSIGNs, (29) 


where v„=(B 2 )''^ f(-)--—(^ 2 X^ 2 )is estimate of v , and F(.) is an estimate of F{.), 
^ \_ 2 dt J 

p = diag{pi];Pi >0, SIGNs = diag{signsi], Vi = L3 . Note that is equal to and F(.) is equal to F(.) 

with all the uncertain terms set identically to zero. Assuming the following estimation error bounds for the 
equivalent control elements are satisfied 

|4,|<4 Z,>0 Vi=:i3, (30) 

(B 2 T s. (31) 


^eq ^eq 


it is easy to show that 


V=/(4,-pS/GiVr)<-2r|4,|-A)- 


Requiring ^'ii guarantee asymptotic origin convergence, for the system in eq. (22) via inner loop 

SMC (eq. (29)), in finite time. Any desired reaching time may be achieved by proper choice of p,- > 0 The 
following choice of p,- will guarantee a negative definite Lyapunov derivative and finite reaching specified by p,- 
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P; >|A,jJ + Pi >L, + Pi V/ = l,3. (33) 

Therefore, the inner-loop SMC in eqs. (29) and (33) provides asymptotic stability of the system in eq. (22) in the 
presence of bounded system uncertainties provided that there is sufficient control power. In other words, robust de¬ 
coupled sliding modes exist characterized by a sliding surface in eq. (21) that is reached in finite time"’*l The 
elements of the control law vector are developed by solving eq.(24) for the actuator command, u, and inserting the 
SMC from eq.(29) 


M; = 5; -t-—(v^,. + Pi sign Si ) Vi = 1,3. 


A continuous inner loop SMC with a boundary layer is used to avoid control chattering. The discontinuous tenus 
sign Si are substituted by the following continuous terms 

[l, ifSi>ei 


sar—= i—, i/|ji|^ei 

£; £; 

-1, ifSi<-Ei 


which results in the following continuous inner loop SMC 

u. = 5. + J-f+ Pi satii-1 Vi = U. (36) 

as \ 

Although the continuous SMC in eq. (36) prevents chattering, the control function does not account for 
actuator limits. If there is not enough control power to compensate for disturbances and uncertainties due to damage 
or other modeling errors, actuator deflection and deflection rate limits may be exceeded and controller integrators 
may windup. Exceeding actuator limits and integrator windup may lead to significant tracking degradation up to loss 
of stability and/or pilot induced oscillations. The actuator limitations in eq. (2) are directly addressed by adapting the 
boundary layer thickness. The boundary layer £,. is selected to satisfy the following inequalities that guarantee 
compliance with actuator displacement and rate limits and avoidance of integrator windup: 

a) Vi = 1,3 - integrator anti-windup, (37) 

b) 5; + — Ve . + Pi — ^ Vi = U - actuator deflection limit compliance, (38) 

as 1 ‘ ^i , 


c) V + Pi-^<5^ Vi = 1,3 - actuator rate limit compliance. (39) 

Integrator windup is avoided and actuator rate and deflection limits are not violated for the following 
choice of sliding surface boundary layers 

£; = max{e,- ,£?,£,?} Vi = 1,3 . (^(^) 

where 


e] =|s,|+q‘. 


Vi = 1,3, 
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' agS^ - [agS-, + V,,. )sign Si 

3,_hki- 

Sm -Ve,, sign Si 


V« = l,3, 


Vi = 1,3, 


and r/ > 0, r,-^ > 0, >0 Vi = 1,3 are selected to avoid chattering'^. 

Remark 3. The proposed re-configuration (adaptation) of the boundary layers is implemented on-line, and therefore 
accounts for failures and damage without explicit parameter identification. 

Remark 4. If at least one of the following inequalities is satisfied 

V,,. signsi >a5„-asSi signsi , | > (44) 

only changing the boundary layer thickness according to eqs. (42) and (43) cannot satisfy the conditions (38) and 
(39). A following approach is be used to avoid this anomaly. 

Set Vgg. = 0 in the continuous SMC eq. (36). The following revised expressions may be used in place of 

eqs. (42) and (43) 

2^-Na-^^2^ (45) 

asiS^-SiSigaSi) 

£3^l4b- + r^ , (46) 

Robustness. 

It is well-known that the continuous SMC provides finite-time convergence only to the vicinity of the 
sliding surface, and robustness is considered in the context of the sliding surface boundary layer and referred to as 
e -robustness. Analyzing robustness of the system's tracking performance in the sliding mode we will estimate 
domains of attraction of sliding surfaces in inner and outer control loops. Further, the system's tracking performance 
will be estimated while the system moves in these domains. The domain of attraction of the system in eqs. (22) and. 
(36) is analyzed by considering the derivative of the Lyapimov function eq. (27) 

3 

-24a-a) '/N>a 


-2^i| A 


if U <q 


The boundary layer l^; | = £; will be reached in finite time since V V |j;| >q, and a solution s{l) of eq. (22) 

is uniformly ultimately bounded'^'. Tracking performance e -robustness in the inner loop may be proved firom the 
following expression 


f _ 

fli + cfn + cf J 7J,dt < £;, TJi = Zci - ziii = 13 
0 


I 
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in the sense that there exist'® gt >0 and d; >0 that }^i\< gi\si\ + di < giSt+di within the sliding surface 


boundary layer (ji;| < £;). 

The system's (1) outer loop motion in the cr -subspace in eq (7) will be effected by an extra disturbance 
term [Bj (.) -i- ABj (.)]'? because the command Zc is tracked in the inner loop with the rj -error. This is 
d = 'P(.) -I- [Bi (.) + Ml (.)]rj - [Bi (.) + Ml (.)]z,. (48) 

The domain of attraction of the origin of the system in eq. (48) with the outer loop continuous SMC in eq. 
(11) with a discontinuous term S’/GT/c replaced by the continuous term in eq. (18) is analyzed by considering the 
derivative of the Lyapunov function in eq. (9) 

y = (7^ {'P(.)+[Bi (.) -I- Ml (.)]r? - [Bi (.)+Ml (.)]z, }. (49) 

The conditions in eq. (15) imply the positive definiteness of the matrix (7 + £(.)). Then the following inequality is 
valid'®: cr^ (7-i -£(.))ct >fcrf, where is a minimal eigenvalue of the matrix (7-i-£(.)). Taking into 
account ^(.) ='P(.)-i-[Bi(.)- hM i(.)]q-(7-i-£(.))'?(.) and assuming |o,(.)|<yf(ei) and Ci = C 2 = C3=Co 
continuous outer loop SMC law is substituted into eq. (49). This is 


tr _ u J 


j r _ 

Requiring y<-"^|o;| for eq. (50) the following domain |cr,|< ° \fii+7i{Ei)]= Vi = l,3 will be 

reached in finite time L = where fL are chosen to provide given reaching times = ^ 'j -— • Tracking 

>= 1,3 p-i 7*,' 

performance x -robustness in the outer loop may be proved from the following expression 

f _ 

ei+c]jeidt<Xii£i), C; =yd-y; Vi = l,3, (51) 


— (T — 

; the sense that there exist'® g,->0 and di>0 that |e,| < < g;—+7; (£,)]+within a sliding 


surface domain of attraction (|cri| < Xii^i)) in outer loop. The value Co > 0 must be chosen small enough and 
the value fi>0 must be chosen large enough to achieve given tracking accuracy of mission angle command 
profiles. The proposed algorithm selects the minimum boundary layer possible without violation of the actuator 
limi ts. This will provide graceful tracking performance degradation as modeling error increases or control power 
diminishes such as during damage or aggressive maneuvering. 



Remark 5. Analyzing the system's steady state tracking performance in the inner loop it is easy to show that 
lim Tj; (r) = 0 Vj = 1,3 if -¥ const. Further, the system's steady state tracking error in the outer loop 

f->©© 

is as follows; lim e,- (r) = 0 Vi = 1,3 if ®(.) —> const and E{.) —> const. 

Examples 

Consider the following nonlinear model of an F-16 jet fightra- at Mach = 0.7, h = 10000 ft, 
^trim =0.\Q6^03rad,5^ =-0.0295rad =9,rim =^rrim =(P,rim =5,,^ =0 

0 = qcos(p- rsin cp (52) 

(pzz p + qsiQ(ptaii6 + rcQS(ptaaO 

■ a = -l3p + 0.04ncos6cos(p + 0.08'i5S9+ZaCt + Z^q+Zs5g (53) 

P = -0.9973/- + ap + 0.0427 cos esm(p + YpP + YpP + Ys^S^ + 

p = -0.1345p^ - O.S225qr + LpP + LpP + Lj.r- 50.9335^ + Lg,. 5,. 

■ q = 0.9586pr-0.0S33(r^-p^)-L94l66 + M,,a + Mgq + MsS, (54) 

r = -0.7256p^ + 0.1345^/- + .^^/3+^pP + ^rr + 4.1255a +N^5^ 

5,=20(u2-5a) 5,=20(ui-5j 5,=20(«3-5,), (55) 

Actuator deflection and rate limits are given by the following 
|5,|<0.37rad, |(5,|<lrad/s Vi = a,e,r. (56) 

Desired flying qualities^' are defined by the following filters 


a j^ + 3j + 4 P +6.18J +16.98 qt* s'^ + 2.2s + 2 

A 50% loss of horizontal tail and rudder areas is used to model aircraft battle damage. The parameter values given 
symbolically in eqs. (53) and (54) are represented as nominal terms (.)„ and additive deviations from nominal due 

3 4 

to the damage and uncertainty A(.) ’ ; 

G = G„+AG, (58) 

where 

Gn = [Za,Zg,Zg,Yp,Yp,Ys,.,Ys^, Lp, Lp, L,., Lg,.,M„,Mq,Mg,Np,Np,N^,Ng,.\= 

[-1.15,0.9937,0.0, - 0.297,0.00085,0.0,0.0, - 53.48, - 4.324, - 0.224, 

10.177,3.724, -1.26, -19.5,17.67,0.234 - 0.649, - 6.155], 

AG = [AZ„, AZ,, AZg, AYp, AYp, AYg,, ALp, ALp, AI^, AI^, AM„, AM^, AMg, ANp, 

ANp, AN,, ANg,] = [0.04- l(r-5), 0.0031- l(f-5), -0.177+0.0885- l(r -5), 

0.0534- l(f -1.5), - 0.00005- l(r -5), 0.0372- 0.0186- l(r -1.5), 0.002466 8.024- l(r -1.5), 

0.071-1(1-1.5), 0.055-l(f-1.5), - 5.089-l(r-1.5), 1.856-1(1-5), 0.42-1(1-5), 

9.75-l(l-5),-5.82-l(i-1.5), 0.01-1(1-1.5), 0.133-1(1-1.5), 3.077-1(1-1.5)1, 

1(t) is a unit - step function. 
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Two flight control problems are considered for this aircraft model. First, a simple longitudinal motion 
example is considered in which tracking of a pitch rate command (i) is desired before and after damage to the 
aircraft. The second example is a flight control problem for all axes in which de-coupled tracking of angle of attack, 
roll and sideslip angle commands is desired before and after damage to the aircraft. 

Problem 1. The following simplified model of the aircraft longitudinal axis dynamics is derived^’^ from the model 


given in eqs. (52)-(59): 



+ BS„ 


5 , = -205, + 20mi , 


(60) 


where 


A = A„+hA, B = B„+hB, 


^a^q 


-1.1500 0.9937 
3.7240 -1.2600 


1 

1 

a q 

AM^AM„ 

r 

, AA = 

=: 

-* 

a q 



0.0400 0.0031 
1.8560 0.4200 


•l(r-5). 


(61) 


B, = 


Ms 


0 

-19.5000 


Afr = 


AZ^ 

AMi 


-0.177+ 0.0885 1(1-5) 
9.7500 1(1-5) 


The elevator rate is limited by 15, \<b, (b = Irad / s), and its deflection is limited by |5, | S a, (a- 031 rad). A 


"virtual" control is defined as 

vi =-205,+20ui . (62) 

and it is obvious that Vj must obey the following inequality: |vi| ^ 5. 

A pitch rate command q^{t) is generated by a reference model of the form in eq. (60) that addresses 

3,4 

desired flying qualities^' where matrices A, ,5, are the reference model state-space matrices : 


■-1.2693 0.9531 

. Br = 

■-0.1770 ■ 

- 9.4176-5.7307_ 

’ r 

-19.5000 


. The pilot’s commands 


are 0.1 rad/s pitch rate pulses of 1.0 sec 


duration with polarities of -, +, -, + at times of 0.0, 3.0, 6.0, and 9.0 sec. respectively. A 50% loss of rudder area 
occurs at 5.0 sec. 

A continuous SMC is designed with boundary layer re-configuration in the format of eq. (21) and eq. (36) 


without integral term to provide given settling time of a transient response in a sliding mode r, <1.0s. The 


coefficient C 2 is chosen C 2 = 5 > - 


and the SMC is designed as follows: 


Ml =5,+0.05piJar—, Ji =f?i+5.07}i, (63) 

^1 

It is assumed that Vj,^ = 0. A value of the gain Pj must be chosen to satisfy inequality (33) with 

P\>\>\eq\+Pl > 

The equivalent control is assumed to have the following structure 
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( 65 ) 


where corresponds to the equivalent control in the nominal case, and is due to model uncertainty and 

aircraft damage. The value of Pi in eq. (64) must be chosen to provide given reaching time ^l.Os . This is'®"'^ 


Pj > JfM. The value of Pi = 0.5 is chosen for all |s(0)| < 0.5. The value of Pi is chosen assuming 


that 


Vl.J< 


^^nom 


. This is 


+ 0.5, 


Vj^ =-0.05128(^^+5pJ +0.495a-0.0519^-3.735^ . 


( 66 ) 

(67) 


Substituting = 0.37,= l.Oandr* =r^ =r^= 0.02 into eqs. (45) and (46) gives the following expressions 


er = 


_a05|sj^ 


-+ 0 . 02 , 


ef =1^1 |pi+0.02. 


( 68 ) 

0.37 - design Si 

So. the re-configurable continuous SMC is completely defined by eqs. (40), (41), (63), (66)-(68). The simulation 
results are shown in figures 1-3. The q^ii) profile is tracked almost perfectly before and after a 50% loss of 


horizontal tail area (fig. 1). The actuator deflection and deflection rate are within the limits (fig.2). The tracking 

3 4 

performance is much better when compared to the corresponding results from past work ’ , which used explicit 
parameter identification for reconfiguration after damage. Further, the re-configurable continuous SMC is less 
complex since on-line identification is not required. The simulation was repeated with measurement noise. To 

compensate for the noise, a low-pass filter with a transfer function W{s) = q filtered the sliding surface, and 


the gain pi was fixed at pj = 3.0 • max|vi^^ | + 0.5. The tracking performance was similar to the case presented in 
fig. 1, and actuator deflection and deflection rate were within limits (fig. 4). 

Problem 2. The multiple time scale concept is employed to design a two-loop control system with continuous SMC 
in both loops such that the attack, roll and sideslip angles follow commands before and after a 50% loss of 
horizontal tail and rudder. 

A continuous SMC is first designed in the outer-loop. Nonlinear eqs. (52) and (53) of the F-16 
mathematical flight model are considered. The angular rate command profiles and are treated as virtual 

controls for the command angles (p,a and /?. The outer-loop SMC design starts with the choice of a sliding 
surface in eq. (6), given a settling time of a transient response in sliding mode t, < 1.55'. The coefficients C? are 


chosen as follows: 



and the sliding surface is designed 


(T; =e,.+3.oJeidT, Vi = 1,3, (69) 

O 


4 
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where e^=(Pc-(p, e 2 =cc^-a, P. A SMC is chosen in the form of eq. (11) with 

Zc^=\peq,Qeq>KqY calculEtecl 38 follows: 

Peq =9c + 3-0(p^ -qsmftanO-rcos(pta.n6-3.0(p 
Qeq =ac+3ac-1.85a+)57-O.O427cos0cos^)-O.O83589 

-3-OPc + 2.7029)3 + cg)+ 0.0427cos0sin <p + 0.00085 \p. 

The following matrices are constructed 


and 


-3.1)3 - 0.9973-(a+ 0.00085) cos (jJ tan© -)3cos(jDtan0 



1 

sin (p tan © 

cos (jj tan © 


0 

0 

o' 

Bi(.) = 

-P 

0.9937 

0 

. ABi(.) = 

0 

0.0031 

0 


a + 0.00085 

0 

-0.9973 


-0.00005 

0 

0 


_ 1 1 

[0 

0 



0 



£ = ABi • V = 


10 “ 




4.97 10 ^ -4.99 10 ^sin^tan© 


4.97 10 ^cosflJtan© 


where |Bi | = -0.991 - 0.9937(a + 0.00085) cos ()) tan © - 0.9973sin ()) tan © . The elements of the E matrix satisfy the 
conditions of eq. (15) in a significant flight envelope. For relatively small angles P,(p,a and © , the matrix Bj is 
diagonal-dominant. This allows outer-loop SMC (eqs. (11) and (18)) design in a simplified de-coupled format. 
Finally, the outer-loop SMC is designed in the following continuous format: 

Pc = P«?+15.0<Ti, 9*5+50.0(72, r, = f^-6.0C3. (70) 

Given the reference profiles Pc,qc^h “ eq- (70) a re-configurable continuous SMC is now designed for 
the inner loop. Recall that Bj must be positive definite for damage and undamaged cases. The matrix 


-B, = 


50.933 0 -10.177 

0 19.5 0 

-4.125 0 6.155 


is diagonal-dominant and positive definite in accordance with Gershgorin 
theorem'^. Note that a control law command sign change justifies analyzing the positive definiteness of -B 2 instead 


of Bq {BqU = i-BqX-u)). Tho damaged matrix -B 2 = 


50.933 0 -5.089 

0 9.75 0 

-4.125 0 3.077 

, 13,19 


is not diagonal dominant, 


however it is still positive definite in accordance with Sylvester criterion 

The sliding surfaces are designed in eq. (21) format by considering the F-16 mathematical flight model in 
eqs. (54) and (55) and given settling time of a transient response in the inner loop sliding mode t, < 0.75 s that is 
twice as fast as in the outer loop. This is 


Si=^i+12T)i+36jjJidT, S 2 =T)2+12r72 +36j772^r, s, = 1 ) 3 + 161 ^ 3 +64 


( 71 ) 


where r\i = Pc-P^ t ?2 = »?3 = '"c “ • 
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The control functions are designed in the format of eq. (36) assuming = 0 


Uj = 5i +0.05pjsat—, Pi>|ve„,| Vj = l,3 . (72) 

Addressing the actuator deflection and deflection rate limits in eq. (56), the boundary layer is reconfigured 
according to eqs. (40), (41, (45) and (46) as follows: 

fij = max'll, e?,ef} Vi = 1,3, (73) 


e] =|j,| + 0.02. 


ef = 


0-05Ji|Pi 


(0.37 - 5 {Signs i ) 


+ 0.02, e? =|5 ,.|p; + 0.02, Vi = 1,3 


(74) 


The following simulation commands are considered for evaluation of the two-loop re-configurable SMC: 

• The angle of attack command (with respect to a trim value = 0.106803 rad) is two 3.0 s duration 
pulses of 0.1 rad amplitude and polarities + and - during time intervals [0,3] and [6, 9] seconds 
respectively. 

• The roll angle command is one pulse of 1.0 rad amplitude during the time interval of [1, 9] seconds. 

• The sideslip angle command is zero for all time. 

The flying quality filters in eq. (57) are used to filter the commands. Loss of 50% of rudder area occurs at 1.5 sec, 
and loss of 50% of tail area occurs at 5.0 sec. The results of the simulation given in figures 5-9 show that the 
prescribed maneuver is accomplished with excellent tracking performance before and after damage to the aircraft 
without violating the actuator limits. Figure 10 shows that the boundary layer thickness varies significantly during 
aggressive commands and once aircraft damage occurs yet remains bounded. 

Conciusions 

A re-configurable two-loop continuous SMC method is developed and successfully applied to a realistic aircraft 


flight control problem. Actuator limit compliance is achieved by sliding surface boundary layer direct adaptation 


and demonstrated through high fidelity simulations of an aircraft subject to major damage and measurement noise. 


n. Finite-Reaching-Time Continuous Sliding Mode Controller for MIMO Nonlinear Systems 

Introduction 

Sliding mode control is a nonlinear robust control technique, which is actively developing during last 30-35 
years'®’". A sliding mode controller design comprises two steps'®'". At the first step a "custom-made" surface 
(manifold) is to be designed in a state variable space such that the closed-loop system exhibits a given (desired) 
performance while moving (sliding) on this surface, called sliding surface. While on the sliding surface the plant's 
dynamics is restricted to the equations of the surface and are robust to matched plant uncertainties and external 
disturbances. At the second step a feedback control law is to be designed to provide convra-gence of a system's 
trajectory to the sliding surface. In particularly the designed control law must provide an asymptotic stability of the 
origin in a sliding surface subspace. Since the closed-loop system exhibits he desired robust performance moving on 
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the sliding surface, it is highly desirable to reach the sliding surface in a finite time. The system's motion on the 
sliding surface is called the sliding mode. Historically, sliding modes were discovered in variable structure systems, 
which feedback structure changes depending on value of a state vector to drive the system's trajectory onto the 
"custom-made" sliding surface. While on the surface the variable structure control yields an infinity frequency 
switching control*”’”, which is practically unrealizable. Dynamics of switching elements and finite sampling time of 
microprocessors prevent from implementation of infinity frequency switching control law and lead to so called 
control chattering (limit cycles with finite frequency oscillation in a closed-loop system). In order to avoid chattering 
a continuous implementation of a discontinuous control law is often considered*^'''*’^. Usually this is an equivalent 
control plus a linear control law with respect to sliding surface. Very often a linear control law is taken with 
saturation*^’*'*. The continuous implementations of a sliding mode controller (SMC) are widely known for 
instance*®’^'^^ Such SMCs are free from chattering, but have some disadvantages comparing with infinity frequency 
switching sliding mode controllers. They are: 

1. In undisturbed case a sliding surface can be reached only asymptotically, i.e. reaching time —> . 

2. In case of nonvanishing disturbances and uncertainties convergence is provided only to some vicinity of the 
sliding surface. It means loss of robustness comparing with infinity frequency switching sliding mode 
controller. 

In this work we consider a continuous sliding mode controller as a specific type of a SMC ’ , but not as 
implementation of a discontinuous high frequency switching control law. Continuous SMCs, which drive 
trajectories of a single input-single-output (SISO) and MIMO nonlinear control system to sliding surfaces in a finite 
time are designed in the works*®’^^ In this report we follow the work^^ in which a fmite-reaching-time continuous 
sliding mode controller is designed for multi-input-multi-ouqiut (MIMO) nonlinear control systems using finite-time 
differential equations analysis^”’”*. Robustness of the designed controller, which is an issue of crucial importance, is 
addressed as well. Theoretical results are validated via computer simulations. 

Problem Formulation 

Consider the nonlinear square MIMO system*^ 

X = fix)+G(x)u, y = h(x) (75) 

where xe R", ye R'",ub R'^ . Assume that the functions f(x), h(x) are smooth vector fields, and columns 
gj{x) Vi = Urn of the matrix G(x)g R"^ are smooth vector fields as well. Assume also that the system (75) is 
conqiletely linearizable in a reasonable domain x g F, i.e. the system (75) does not have internal (zero) dynamics. 
Given in real time an output reference profile y/t) identify a continuous sliding mode controller which 

1. provides a given eigenvalues to the de-coupled output tracking error performance in the sliding mode, 

2. drives the closed-loop system trajectory to the sliding surface in a finite time. 

System Transformation 

Differentiating the outputs, the system (75) can be easily transformed to a normal form*” 



■yCr, ■ 

_ 

s s 

1 _ 

+ E(x)u, 

E(x)^ 

L,, (I,)-'/,, 

I- 





L,, {Ly-%) )...)_ 


where |£(j:)| 0 Vjc€ r; and Vj = 1,to are corresponding Lie derivatives. 


Terminal Attractor 

Following the works^®'^\ a useful result is formulated in a following Lenuna. 

Lemma. The origin of a differential equation 

i = -0)z'‘, z(0)>0,Q)>0 (77) 

is a terminal attractor with a finite reaching time 


" tB(l-a) 


(78) 


Indeed, integrating (3) we have obtained 

{zitif-'' = (z( 0 )T‘‘ -{\-a)(Ot. (79) 

Requiring zit^ )=0 implies formula (78). 

Finite-Reaching-Time SMC Design for Nonlinear MEMO Output Tracking 
Achieving de-coupled output tracking error performance in the sliding mode, the following sliding surfaces 
are designed 

cr,- +... + Ci 4 cf' +<^i.o«i. Vi = 1,ot, (80) 


C; - - 

where e,- = ,(/)- }’,•(?), ey’ = —^ and the coefficients c,- • Vi = l,mV; = 0, q - 2 are chosen to provide given 

’ dt^ 

eigenvalues placement to the de-coupled differential equations 0; = 0 Vi = 1, /n. 

Designing a continuous sliding mode controller for the system (76) we need to provide asymptotic stability 
to its motion in the cr -subspace, which is described as follows; 

ct = ^(.)-F(jc)m (81) 

where 


^(.) = = +... + c,.ocf^ -^^.(x) Vi = l,m. 


Introducing a new control variable 
u = E{x)u , 

the system (81) can be rewritten in a scalar format. This is 
O’; =¥';(•)-M; Vi = l,m 


(82) 

(83) 

(84) 


The system (84) is obviously input-output de-coupled. Hence, analyzing stability of the origin of the system 
(84), we can analyze stability of each i*’* equation Vi = 1, m . Candidates to Lyapunov functions are introduced 


9-18 







V,=i<7f <*5) 

2 

We will look for the derivative of the candidates to Lyapunov functions (85) in the following format: 

V; = V,"' , fl; > 0, >0 . (86) 

Then, applying Lemma we can conclude that the origin of the i”" equation of the system (84) will be reached in a 

finite time Vi = l,m 

(mitl VaMoi) 

' (Diil-ai) 

It means that <7,- =0 Vi = Lm become terminal attractors. Substituting expressions (80) and (84) into formulas 
(85), a control function u is identified as follows: 

u =-'¥{.) +R-l., ( 88 ) 

where /? = £iiag|-^| Vi = l,m; X = . 

Substituting equation (88) into equations (84), we obtained equations of the closed-loop system’s (76) 
motion in the <J -subspace. They are 

df = Vi = 1, m . (89) 

Making coordinates of the control function (88) continuous and avoiding singularity in the origin of the system (89), 
we will look for the parameters "a, " in the following format: 

ai=— Vi = l,m, (90) 

Qi 

where Pi and Qi are integer numbers. Since a,- can be either positive or negative the values of and 
2p. -q^ >0 must be odd. Hence, the parameters p,- and q^ must meet the following conditions 

P;,^; e n,: ■{pi,q'i|2p,>q’i, Pi<qi, ?,■ is odd} Vi = l,m. (91) 

As soon as the values of the parameters p,- and qi have been identified in accordance with the conditions (17) the 
value of the parameters ta,- must be chosen to achieve a given reaching times (87). 

The original finite-reaching-time continuous sliding mode control law u has been identified in accordance 
with formulas (83) and (88) as follows: 

« = £(x)-1-'P(.)-t-i?x] (92) 

The obtained results are formulated into the following Theorem. 

Theorem. The sliding surfaces (84) will be reached in finite times (87) via the continuous sliding mode controller 
(92) if values of parameters "a," are chosen in accordance with formulas (90) and (91). 

Robustness 


Assuming 
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'¥{.) = 'Po (.) + A'PC.), Eix) = Eo (x) + AE(x) (93) 

where ATC.) and AE{x) are uncertain vector and matrix correspondingly. Designing control (92) in the following 
format 

u = Eoix)-^[-Wo{.) + R-l], (94) 

and substituting expression (94) into equations (84), we obtained 

C7 = A4'(.)-AA/-4'o(.)-(/ + AAf)/?-Z, (95) 

where AM = AE(x) ■ Eq^ (x) . 

Equation (95) is rewritten in a scalar format. TTiis is 

^ CO 

(Ti =AYit)-{AM , V, = i:^ (96) 

2 ' j=i 2 ■' 

Substituting equations (96) into equations (85), we obtained 


2''‘ - Avr,(.) + (AA/ •'Po(-)).- 


- a)..(l + AA/i,.) + - 




Assuming 


|AM,.|<L,<1, |aM.^|<L^, \AYi\<Ri, |( AM-%(.)),.I <G,., \7j\<nj Vi = l,m V; = l,m 
we introduce positive constants 

m /-.i 

2 ^ 7 2a —1 

;=1 2 


Transforming equations (97) taking into account expressions (98) and (99) we obtained 


y,. < - m.. (1 - L,) - v,"- Vi = 1, m. 

p, ‘ 

Inequalities (100) are rewritten in a format 

y,-<-(55,.y;“' Vi = hm, 

where 

(y,(0)y-“‘ — 

CO; = - - -, Vi = 1, m 
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ap 

Inequalities (103) actually form domains where conditions (101) are met. It means that the following domains 




Vi =l,m 


will be reached by the trajectories (T,- in given finite times (87), and trajectories cr,- will stay in these domains 
thereafter. Hence, solutions of the closed loop systems (96) are uniformly ultimately bounded'"*. What is remarkable, 
that choosing values of the parameters to,- to meet inequalities 

2‘''0, +Si — QOS) 

cj. >->- !-Vi=l,m 

■ 1-L, 

and making 2a; -1 arbitrary small, we can make a size of the domains of attraction in eq. (104) arbitrary small as 
well, retaining the control law (94) continuous. In order to achieve a similar result, using linear/saturation 
approximation of a discontinuous sliding mode controller, a gain of such controller must be arbitrary large'^’'"*, 
which is impractical. Hence, a designed continuous sliding mode controller is extremely robust. 

Example 

A nonlinear MIMO system given in the format (75) is considered 

'ii = X2 

X2 = cos X3 + {xi + l)<i + «2 + /i (0 (106) 

X 2 = sin Xj + O.Suj + (^2 + 1)<2 + (f) 

yi =^ 1 . J’a =^3 

Functions /,• (f): |/f (t)\ < Vi Vi = 1,2 are unknown, bounded, matched disturbances. The system (106) is completely 
linearizable with a vector-relative degree r = {2, l}. The system (106) is transformed'^’'^ into the normal format (76) 


as follows: 


yi cos 




yjl Isinxi 


0.5 (x|+l)jL“2j’ 


The goal is to design a robust, continuous sliding mode controller to provide de-coupied tracking of smooth output 
reference profiles yir(t) and y 2 r (0 in finite-reaching-time sliding modes. 

The sliding surfaces are designed in the format (80) as follows 

(Ti=ei + 5ci, <72 =£ 2 , 

where Cj = yj^ - yi, ^2 = y 2 r “ >’2 • 

The parameters p,-, and tO; are identified Vi = 1,2. They are 

Pi-P2-^’ 9i= 92=3, oil =16, (B2=10. 

The continuous "cubic root" sliding mode controller is designed in the format (92) as follows: 


•I 
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\ui = yir +5)'l. -COSX 3 - 5 X 2 +10.0^ 

[«2 = hr - sin + 6.3^/^ 

The system (106) was simulated with the continuous "cubic root" sliding mode controller (110) and with the 
continuous "linear" sliding mode controller (111) 


Ml = yir + 5yir - cos X 3 - 5 X 2 + IbCTj, 
“2 - >' 2 r “ sin Xi + 20(72. 


(Ill) 


It is worth to note that gains before the <t - terms in formulas (111) are increased from 10 to 16 and from 6.3 to 20 
conparing with formulas (110), which helps the “linear” controller (111) to be more aggressive. 

The following initial conditions and the reference profiles were used 
Xi(0)= 0.2, X2(0)= 0.1, X3(0) =-0.15, yi/r) = 0.5sinf, y 2 r(t) = Oncost. (112) 

The results of the simulation are shown in fig. 11-17. Simulations confirm a finite-reaching time property 
of the designed continuous sliding mode controller and show improving robustness to disturbances comparing with 
“linear” sliding mode controller. 

Conclusion 

The continuous sliding mode controller, which reaches sliding surfaces in a finite time, is developed for 
nonlinear MIMO output tracking. Robustness of the designed controller is analytically analyzed and validated via 
computer simulations. 


TTT. Flight Control Using Re-configurable Sliding Modes for Tailess Aircraft 


Introduction 

Tailess aircraft has a reduced vertical tail or no vertical tail at all. Radar cross section and weight reduction 
benefits have influenced this tailess aircraft configuration. On the other hand, tailess configuration present a 
challenge from a stability and control perspective”'”. Absence of a vertical tail reduces directional stability and 
directional control power. Unconventional control effectors are usually added to overcome control power 
deficiencies”. The unconventional control effectors must be allocated with other conventional effectors to achieve 
acceptable maneuvering performance of a tailess fighter via a corresponding control law design. 

A modular (multiple-loop) flight control structure for flight control of a tailess aircraft is developed^^’^. It 
includes a feedback control, parameter estimation and control allocation modules. TTie control allocation module 
allocates the control effectors to achieve maximum control power and limit control law commands to prevent 
actuator saturation”. The developed^^ control allocation algorithm operates upon assumption that full information 
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about an aircraft mathematical model is available. A parameter estimation module supports this assumption. 
However, in case of a battle damage to an aircraft, parameter estimation becomes a "bottle-neck" problem, because 
errors in damage estimation can lead to a significant degradation of an aircraft performance. 

In this work we develop a re-configurable flight control design technique on the basis of Sliding Mode 
Control for tailess aircraft with damages and actuator limits. On-line damage (parameter) estimation is not required. 
An optimal control allocation algorithm is employed using nominal mathematical model of an aircraft. Continuous 
SMC with direct adaptation of a boundary layer automatically adjusts to an aircraft damage and prevents actuator 
from deflection and rate saturation. 

Problem Formulation 

Consider a mathematical flight dynamics model of a rigid tailess aircraft with the following cascade 


structure 

y=f\(y^ ^i(y> s, |)+ iBM+ ABi(y, |j)z 

z = f2(y.z,^)+^f2(y>z^^)+iB2(y.z,^)+AB2(y,z,^))5 
i = f3(y.z,^>S) 

S = -As(3-u), 


where y e 91^ is a vector of command angles (y = {(p,a, pf, and q),a,p, are roll angle, angle of attack, angle of 
sideslip respectively); z6 is a vector of angular rates iz = {p,q,rf , and p,q,r are roll, pitch and yaw rates 
respectively); 5e3i'" is a vector of control surface deflections (mS3), me 91'" is a control vector containing 
actuator inputs, ^£91* is a vector containing all other states not contained in y, z, or 5; /i(y,|)e9i^, 
/ 2 (y,z,^)E 9i^, /3(y,z,|,5)€ 91* are differentiable vector-fiinctions; 5i(y,|)e 91^'^, (y,z,|)e are 

known matrices; A/l6 9l^ A/2€9i^ are unknown bounded smooth perturbations, 

caused by damage, failures and modeling uncertainties; det(Bi(y,^)-t-ABi(y,^))7iO Vy,(^ e . 

The deflections and deflection rates of the actuators are assumed to be bounded: 


|5i|<(5„,5i <5„Vi = l,m. 


(114) 


Problem: Given a real-time command reference profile design a continuous SMC to obtain the robust 

asymptotic de-coupled motion of the output tracking errors 

limlyci-yil = 0 Vi = 1,3 (115) 

in sliding modes described by homogeneous time-invariant equations with given eigenvalues placement for the 
system in eqs. (113) and (114). 

Assumption. It is assumed that the equations 




(116) 


are stable, where 5^ is the commanded deflection that satisfies equation 5^ =-As(5c -m) , and u is the control 


that maintains commanded states y,., z^ ■ 


4 
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This is a reasonable assumption since if any elements of ^ were not stable, it is argued that those elements 
should be moved to either y or z depending on the dynamical structure. This assumption is analogous to the 
minimum phase assumption used in feedback linearization. 

The formulated control problem is similar to the problem that was formulated and addresses in the first 
chapter of this report. The main difference is in non-square configuration of a tailess aircraft model in eq. (113). 
Outer Loop Finite-Reaching Time Continuous SMC Design 
Considering the angular rate vector z as a virtual control command z^, only the command angle dynamics 
are considered for the outer loop SMC design 

y = fi(y.O+^fi(y.3,^)+(Bi(y.^)+ABi(y,0)z . (117) 

The system eq. (117) is already in a eq (76) format with the vector relative degree of eq. (117) is r =[1,1,1]^, and 
the sliding surface can be designed as follows'®"*^: 

t 

<T = e + C^jedT = 0, CJ,ee , (118) 

0 

whCTe e = {ci,e 2.«3 F . = I’d ~ yt 1* command angle error and C' = diag^] } Vi = 1,3 is chosen to achieve 

desired command angle error dynamics. In order to have desired command angle error dynamics and to nullify a 
steady state tracking error due to continuous SMC the sliding surface is designed with an integral term in the PI 
format eq. (118). 

The second step of the outer loop SMC design is to choose a control function that achieves asymptotic 
stability with finite reaching time of the a dynamics given as 

CT = % (.) -l- A'P(.)- [Bi (.) + ABi (.)]z,, (119) 

where 

'PoO = 3'.-/i(.) + C'e, A'P(.) = -A/-,(.). (120) 

Applying the finite-reaching time continuous SMC design technique that is developed in the Chapter n and using 
eq. (94), we obtained expression for the virtual control z^ as follows: 

( 121 ) 


where 


Zc = B'\ 


‘(.)'Po(.), R = diag 


^ ^ f 2 

2“* ’ 2“^ ’ 2“^ J ’ ^ 


2a.-l 203 - 1 , 2a3-l 




( 122 ) 


Denoting AM(.) = A5 i(.) bF(-). g(.) = AM (.)•%(.) and assuming 
|AM;,|<L,; <1, |aM,;,|<L,;;, |A'Pi|<Ai, |^(.)i|<G,-, j\<nj Vi = l,3 y7=l,3. 


(123) 


we u nif orm ultimate boundadness of a -dynamics. In other words, the control law in eq. (121) guarantees 
convergence of cr — trajectories to a domain 
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1 

\2a,-l 


(124) 


2‘‘‘Oi 


V/ = l,3, 


where 


Vi.l3. (125) 

"2 ' 

Parameters co,- and a; of the continuous finite-reaching-time SMC eq. (121) must be chosen to provide given 

reaching time in eq (102) and given size of the domain of convergence of (T - trajectories in eq. (124). 

Inner Loop Finite-Reaching-Time Continuous SMC Design and Control Allocation. 

The angular rate z - dynamics are considered for the inner loop SMC design 
z = f2(y.z,^)+^2(y.z,^MBM+^2(y.^))^ ■ (^ 26 ) 

Vector-deflection 5 is treated as a control function in eq. (126). 

A continuous SMC 5^ should be designed to obtain the robust asymptotic de-coupled motion of the tracking errors 
limlzei - z,| = 0 Vi = 1,3 (127) 

in sliding modes described by homogeneous time-invariant equations with given eigenvalues placement for the 
system in eqs. (126) and (2). Note that the angular rate command Zc is generated by the outer loop SMC. 

The vector deflection-command 5^ will be designed into two steps. 

At the first step, denoting 

B2(y,^)5c = V,,, v,, e (128) 

and rewriting eq. (126) in a following square structure; 

i = /2()'.z.^)+4f2()'.z.|l+Vc +^2(>’-0^c - (^29) 

we will design in eq. (129) as a new control function to address a problem in eq. (127). 

At the second step , assuming the control law is designed in a finite-reaching-time continuous SMC format, a 
control allocation must be performed to transform the designed control law into deflection vector-command 3^: 
S,=BA>c> 

where e is an allocation matrix that must be identified. 

Substituting eq. (130) into eq. (129) we obtained 

z = f 2 (y>z, 0 +^f 2 (y>z,^)+U +£(-))Vc. (131) 

where E{.) = AB 2 (.)B^{.). Obviously, structure of eq. (131) is absolutely the same as in eq. (117). Therefore, the 
sliding surface is designed similar to eq. (118) 


t 

= e + ^ jed't = 0, (j,e€R^, 



where e ={ei,e 2 ,es}^, e,- =Zci -Zj is the command angle error and =diag^j] Vi = l,3 is chosen to achieve 
desired angular rate error dynamics. 

Remark. The inner loop sliding mode dynamics in eq. (132) must be much faster than the outer-loop sliding mode 
dynamics in eq. (118) to preserve sufficient time scale separation between the loops. 

Designing a control function that achieves asymptotic stability with a finite reaching time of the a 
dynamics given as 

^ = ¥o(.)+A^(.)-[7 + £(.)K. (133) 

whCTe 

%(-) = 2c-/2(-) + C'e, An) = -A^2C), (134) 

we obtained a following expression for the control : 


v^=v^ +Ri:, 

c 


(135) 


where 


^ D J- I 1 . V -253-1 Y 

'l'o(-). E = .cTj ^ / . 


[2“* 2“ 

Parameters m,- and dj in eq. (136) can be identified by analogy with parameters ©,■ and a,- ineq. (122). 

An allocation matrix B^, (.) is identified as a solution to the following quadratic programming problem: 


(136) 


J = 5^Q5c 




■»min 


(137) 


as follows: 

BA.) = Q-^Bl(.)[B2(.)-Q-^Bli.)Y, (138) 

where 2 is a positive definite matrix. A control allocation algorithm developed in the work^^ can be applied as well. 
A "Very" Inner Loop Re-configurable Continuous SMC Design Addressing Actuator Limits 
At this point we have a vector-deflection command profile 5^ that is generated by an inner loop finite- 

reaching-time continuous SMC in eqs. (135) and (136). Being executed this profile will achieve a high tracking 
accuracy of mission angles in presence of additive and multiplicative uncCTtainties (failures) without their explicit 
identification. Now we need to execute the vector-deflection command profile 5^ by actuators, taking into account 
actuator deflection and deflection limits. A mathematical model of actuators is given as follows: 

5 = -AsiS-u), = (139) 

A problem is to design a control law u achieving 

lim|5„-5i| = 0 Vi = l,m (140) 


and meeting limitations in eq. (139). 

Obviously, a plant in eq. (139) is of a square structure. Therefore, and a design technique of a re- 
configurable continuous SMC with direct boundary layer adaptation, which is developed in the Chapter I of this 
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report, is used to address the problem in eqs. (139) and (14). A sliding surface that provides desired tracking 
dynamics is chosen in the first step of the "very" inner loop SMC design. The vector relative degree of eq. (139) is 

r = [1,1, ...1]’", r € 91 ^, and the sliding surface is designed in a PI format'®'’^. This is 


f 

s = rj + C^jridt = 0, s,r}&R”', 


whra-e = and Vj=l,m are chosen to provide the desired, de-coupled 

deflection tracking dynamics in the sliding mode. An integral term is added to nullify a steady state tracking error 
due to continuous in^jlementation of SMC. 

Remark. The "very" inner loop sliding mode dynamics in eq. (141) must be much faster than the inner loop sliding 
mode dynamics in eq. (132) to preserve sufficient time scale separation between the loops. 

The second step of the inner loop SMC design is to choose a control function u that achieves asymptotic 

stability of the s dynamics given by the following 

i = (142) 

whore 

v = As{u-S). (143^ 

The following Lyapunov function candidate is considered for design of a stabilizing SMC 

V=-/j>0. (144) 

2 

The derivative of the Lyapunov function candidate in eq. (144) is given by the following 

V = /j = /K+C^t7-vl. (145) 

The equivalent control is given by 

The following additional discontinuous control term is constructed to provide negative definiteness of the Lyapunov 
function derivative in eq. (145) 

v=^v^+pSIGNs, (147) 

where p = diag{pi}, p; > 0, SIGNs = diag{sign Sf }, Vi = 1, m. It is easy to show that 

V=-f{pSIGNs)=^^i |i,|, (14^) 

M 

and the sliding surface in eq. (141) will be reached in a finite time. 

The elements of the control law vector are developed by solving eq. (143) for the actuator command, «, and 

inserting the SMC from eq.(147) 


«i = 5; +—(v,y, + Pi sign Si) Vi = 1, m. 


4 
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A continuous "very" inner loop SMC with a boundary layer is used to avoid control chattering. The discontinuous 
terms signsj are substituted by the following continuous terms 

1, if Si > Ei 

= (150) 

- 1 , ifSi<-Ei 

which results in the following continuous "very" inner loop SMC 


M. = 5; + — v„, + Pi sat — Vi = l,m. 


Although the continuous SMC in eq. (151) prevents chattering, the control function does not account for 
actuator limits. If there is not enough control power to compensate for disturbances and uncertainties due to damage 
or other modeling errors, actuator deflection and deflection rate linoits may be exceeded and controller integrators 
may windup. Exceeding actuator limits and integrator windup may lead to significant tracking degradation up to loss 
of stability and/or pilot induced oscillations. The actuator limitations in eq. (139) are directly addressed by adapting 
the boundary layer thickness. The boundary layer thickness £,. is chosen to satisfy the following inequalities that 
guarantee compliance with actuator displacement and rate limits and avoidance of integrator windup: 

a) 15,12 Vi = 1, m - integrator anti-windup, (152) 

. / N _ 

b) 5.+— V + p- — <5„ Vi = l,m - actuator deflection limit compliance, (153) 

d) V + pj—< Vi = 1, m - actuator rate limit compliance. (154) 

*'■ Ei 

Integrator windup is avoided and actuator rate and deflection limits are not violated for the following 
choice of sliding surface boundary layers 


e, =iiiax 


where 






Vt = 1, m . 


Vi = 1, m , 


'' as5„-[as5i 


Vi = 1, m , 


= _ - +r?, Vi = 1,3, (158) 

signs i 

and rj* > 0, > 0, > 0 Vi = 1,3 are selected to avoid chattering^^ 

Remark. The proposed re-configuration (adaptation) of the boundary layers is implemented on-line, and therefore 
accounts for failures and damage without explicit parameter identification. 

Remark. If at least one of the following inequalities is satisfied 
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(159) 


signs i >asS„-as5i signs i , | | > 5„ 

only changing the boundary layer thickness according to eqs. (157) and (158) cannot satisfy the eqs. (153) and 
(154). The following approach may is used to avoid this anomaly. 

Use equivalent control estimate instead of v^. in the continuous SMC eq. (151) and set it to zero: 


Vgj, = 0. The following revised e3q)ressions may be used in place of eqs. (157) and (158) 


ef = 


Si|Pi 


-diSign Si) 


■ + rr 


(160) 



(161) 


Example: Tailess Fighter Re-configurable Control 

The tailess aircraft is a 65-degree sweep delta wing, single engine, multi-role supersonic fighter with 
internal weapons carriage. A mathematical model of a tailess jet fighter that was developed under is the Innovative 
Control Effectors (ICE) program”’^"* is used in this rqiort to demonstrate abilities of a re-configurable continuous 
SMC to address aerodynamic surface failures and actuator saturation during execution of flight maneuvers. The ICE 
configuration includes a large suite of conventional and innovative control effectors. The conventional effectors 
include elevens, pitch flaps, thrust vectoring, and outboard leading edge flaps. The innovative control effectors 
include spoiler-slot deflectors and all-moving tips. The following linear model is generated at Mach o.4 and 15,000 
ft. altitude for demonstration of the re-configurable SMC technique. Note that thrust vectoring, spoiler-slot 
deflectors and outboard leading edge flaps are assumed out of order and are not included in the model. 


a 


■-0.6344 

0.0027 

0 

0.9871 

0 

a 

P 


0 

-0.0038 

0.1540 

0 

-0.9876 

P 

P 


0 

-8.2125 

-0.7849 

0 

0.1171 

P 

q 


-0.5971 

0 

0 

-0.5099 

0 

q 

r 


0 

-0.8887 

-0.0299 

0 

-0.0156 

r 


'bn 

-0.0459 

Pi 3 

-0.0133 

-0.0133 ■ 

Pel 

^21 

0.0047 

0 

0.0031 

-0.0031 

Per 

bi\ 

-3.7830 

0 

1.8255 

-1.8255 

bpflap 

^41 

-2.5115 

P 43 

-0.9494 

-0.9494 

^ amt 1 

psi 

-0.0453 

0 

-0.2081 

0.2081 

Poml, 

L ' J 


(162) 


Loss of 50% of a left elevator area and 50% of a pitch flap occur at 5.0 sec and is simulate as follows: 

_r-0.0459, t<5.0s _|-0.0047, t<5.0s ^ _r3.783, t<5.0s 

*"“1-0.02295, r>5.0s’*"“l—0.00237, t>5.0^’“ll.8915, r>5.0s 

_r-2.5114, f<5.0s _f-0.0453, f<5.0j _f-0.0395, r<5.0j ^ ^j-1.9042, t<5.0s 

*'‘^^{-1.2557, t>5.0j’*" “[-0.02265, r>5.0s“[-0.01975, r>5.0s ''^“[-0.9521, f>5.0s 

Five actuators are modeled as follows: 
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( 163 ) 


" 







r -1 





20 

0 

0 

0 

o' 


5c/ 


“1 


L 


0 

20 

0 

0 

0 


5cc 


“2 


^pflap 

= - 

0 

0 

20 

0 

0 


Spflap 

- 

«3 


^ amti 


0 

0 

0 

20 

0 


S amti 


“4 


5 


0 

0 

0 

0 

20 _ 


Slant, 


.“5. 


^ amt^ 






\ 



> 


|5, |<0.5rad, |5,|<lrad/5 Vi = l,5 (164) 

Body rate vector is chosen as command vector 

y = \p,Q,rY. (165) 

The imcommanded variables are defined to be angle of attack a and sideslip angle . 


Angular Rate (Outer Loop) Continuous Finite-Reaching-Time SMC Design. 
The vector 

V = [vi,V2,V3f 


(166) 


is considered as a virtual control to follow given angular rate command profiles (t) = [pc (0.9c (0. ''(OF . where 


B = 


3.7830 

-2.5114 

0.0453 


-3.7830 0 1.8255 -1.8255 

-2.5115 -1.9042 - 0.9494 - 0.9494 

-0.0453 0 - 0.2081 0.2081 


(167) 


The sliding surface is designed in a format eq. (132) as follows: 

t 

a=e + ^jedT = 0, ^ = diag{2,2,2}, a,eeR\ (168) 

0 


where e = {ei,e 2.^3 F > ^nd - p, €2 = qc-q ej = - r are the command angular rate errors. 

The finite-reaching time continuous SMC is designed in a format eqs. (135) and (136) as follows: 

V =v^ (169) 

t. 

where 

v^^=0, .R =diag{7.5,3.0,3.0}; 1 = ^^, • (I'^O) 

Control Allocation 

The designed control law is transformed into the deflection vector-command 5^ 


Sc = , 

by allocation matrix B^, which is calculated using eqs. (137) and (138) with Q = / as follows: 


( 171 ) 


= 


■ 0.1196 

-0.1392 

1.0492 

-0.1196 

-0.1392 

-1.0492 

0 

-0.1055 

0 

0.0260 

-0.0526 

-2.1743 

-0.0260 

-0.0526 

2.1743 


(172) 


The "Very" Inner Re-configurable continuous SMC Design 

The re-configurable continuous SMC u is designed to accurately track the deflection command vector 5^ 

by conqjensation of the actuator in eq. (163) addressing actuator limits in eq. (164). 

The sliding surface is designed in a format eq. (141). This is 


I 

s = T] + C^jT]dT = 0, C^ = 0, s,neR\ 


(173) 


where 77 = { 7 }i, 7 J 2 .t? 3 .^ 4 .^sF. Vi =^ci -5. ' 7 'i- 1.5 . 

The re-configurable continuous SMC is designed in eq. (151) format with boundary layer direct adaptation 
in accordance with eqs. (155), (156), (160) and (161). This is 
A 


- 1 

Ui=Si +— 

as 


Veq, + Pi sat — 


, V =0, p,=l, as =20 Vi = l,5. 


(174) 


Vi = 1,5. 


(176) 


V "-i 

The boundary layers are calculated as follows: 

fij =max|ej,e?,ef} Vi = l,5, (175) 

wha'e 

= +0.005, +0-005 

' ' ' 0.5 - 5,sign j; 

Simulations. The following flying quality was enforced during simulations 

Pc - ^ S^ = - - -, =-. The results of the simulation, that are given in figures 

p* j^+2s + l q s^+3j + 4 t +1.8j+.81 

17-28, show that the prescribed maneuver is accomplished with excellent tracking performance before and after 
damage to the tailess aircraft without violating the actuator limits. 


Conclusions 

A re-configurable two-loop continuous SMC method is developed and successfully applied to a realistic 
tailess aircraft flight control problem. Actuator limit conqiliance is achieved by sliding surface boundary layer direct 
adaptation and demonstrated through high fidelity simulations of a tailess aircraft subject to major damage. 
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Fig. 26 Right all moving tip deflection rate 



Fig. 27 First and second boundary layers 


Fig. 28 Third, fourth and fifth boundary layers 
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Introduction 


This report describes statistical methods for feature extraction and classification of air to ground 
HRR signals. This research is developed under the System Oriented HRR Automatic Target 
Recognition Program (SHARP). The SHARP objective is to design, evaluate, test, and 
demonstrate air-to-ground ATR. Target classification is one of the important tasks in target 
tracking, detection, and recognition within a battlefield scenario. SAR images of moving targets 
are based on the Doppler effect and quite often are not as clear as the images of stationary targets. 
Variable terrain and variances of the observed objects make this task more difficult. 

A detailed overview of moving target data collection and target modeling issues are presented in 
[Wes98]. In this research the Moving Target Acquisition and Recognition program data 
collections 1 & 2, scene 1 were used. These data were produced using X-band 1x1 foot 
resolution SAR images recorded at 15° and 17° depression angles with 360° coverage in aspect 
angle. Each 17° data consists of approximately 250 SAR chips per target and 15° data of 
approximately 195 SAR chips per target Data collected at 15° were used for testing and 17° for 
training. 10 target classes were considered with two target classes (BMP2 and T72) having 
several variants. 

The presentation includes results of research on hybrid distance meastires and piecewise linear 
template representation for feature extraction and classification. This work includes establishing 
hybrid distance measures, describing their statistical properties and tests performed on five-class 
data set to illustrate quality of these measures. A new piecewise linear strategy to represent 
statistical information of the training data is described. It is complemented with developed 
Matlab 5.0 programs for data representation, evaluation of computational cost and quality of 
developed templates and results of test on HRR data for five and ten class problems. Reference 
to the baseline method and comparative results for classification accuracy arc discussed 
throughout the entire report. The presented approach requires no prior knowledge of data 
distribution and provides a reliable performance. As shown in the application results given in this 
report, the developed hybrid measures and piecewise linear representation of the training data 
bring improvement to classification results in most situations. 
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often contains very few samples, usually the concerned region is too small to obtain a statistical 
estimate of normal parameters, such as covariance. 

By the statistical experiments, we find that the minimum distance (called MD) between every two 
samples coming from a uniform distribution has a predictable maximum value, which is in a 
positive proportion to the average minimum distance (AMD) of the cluster. Moreover, the ratio 
of MD over AMD decreases with the increase of the data points number (N) and dimensionality 
(D). When D and N are large, the ratio stays close to specified value. Figl-1 gives the ratio vs. N 
with D = 1-4. These curves are used to determine the threshold Thd_Growth for the cluster 
growth program. 



Fig. 1-1 ratio ofMD over AMD vs. N with D=l,23>4 


1.1.2 CLUSTER MERGE PROGRAM 

After the cluster growth program, we get a lot of small seed clusters, some of which overlap each 
other. The Cluster Merge program selects the clusters to merge based on a similarity between the 
pdf (Possibility Density Function) estimates before and after merging. 

As shown in Fig. 1-2, clusters A and B have their Gaussian distributions~ N{m ^, ) and 

pdfB ~ N{m ^, ). The two distributions form a multi-Gaussian approximation, represented by 

pdfW. Suppose that the points in two clusters are from one Gaussian distribution and should be 
merged. The distribution after merging is another Gaussian, pdJM. The size of the non- 
overlapped area, represented as NOA, shows the difference of pdfs before and after merging. 
When NOA is smaller than some threshold, Thd_Merge, the two clusters are merged. The 
clusters obtained after merging will be checked to merge with other clusters. The cluster merge 
program repeats checking until there are no two clusters whose distributions are so consistent to 
be merged. 
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1.2 Application results 


For simplicity, clustering approach is applied on the two-class problem of 2-D data. The data 
consists of the 2 major features of HRR data extracted by the Singular Value Decomposition 
(SVD). The classifier is basically a Baysian with the concept of rejection. The performance is 
compared with the classification without clustering. 

In our classifier, the concept of rejection is introduced. As shown in Fig. 1-4, there are 3 parts of 
the rejection area. Considering the system noise, such as insufficient calculation accuracy and 
training data, the data points in those areas are rejected without classification. Therefore, there 
are 3 ratios defined in the output: the misclassification ratio Rl, the rejection ratio R2 and the 
correct classification ratio R3, and their sum equals one. 



Fig. 1-4 Deflnition of the rejection area 

Fig. 1-5 gives the Rl of classifications with and without clustering. The whole experimental 
results are appended at the end of the report. 





•Iws«*ftn9 




Fig. 1-5 Comparison of Rl with and without clustering 
(in the classification between Target 3 and every other target) 
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3. Clustering increases R2 in most situations. That is, clustering more closely approximates the 
PDF. The more accurate our approximation of PDF is, the greater a chance we have of 
finding the area where the local probability densities of two classes are close. This is helpful 
to our further improvement of classification. Other methods can be applied to the rejected 
data, and their performance can be accumulated with that of clustering. In this sense, 
clustering is open to be combined with other approaches. 

In spite of the improvement, clustering algorithms has its defects. Since the clustering algorithm 
approximates the PDF according to the data distribution, naturally, the result is dependent on the 
training data. When the data is distributed without obvious clusters, our program can not improve 
the classification result In some situations, a worse result is obtained because of the noise 
introduced by the calculation. Clustering algorithm should therefore be applied selectively. 
Moreover, clustering algorithm is sensitive to changes in the original data. Insufficient data or the 
difference between the training data and test data can ruin the clustering performance. Therefore, 
suitable preprocessing is needed before clustering is applied. 


2 Hybrid Distance Approach 

When observed over the entire aspect angle range HRR signal values vary as shown in Fig. 2-1. 
This variation requires representation of the training data by many templates. Even though 
templates may be set as often as one-degree azimuth angle, it still does not represent a perfect fit 
to the training data. As a result recognition of the test data is difficult and prone to errors. 


HRRTrajectoiy forBTRTO SN#C71 @ deprsssion= 17 



Aspect angle (in degrees) 

Figure 2-1 hKK return signal strength tor various azimuth and range values for BTR70. 
One would expect that if the template is capable to correctly represent properties of the test data, 
then for a template fi-om the same target class the euclidean distance between the template and the 
observed profile should be minimum. Euclidean distance (ED) is defined as: 
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represented as MD_D; in the other dimensions, dominated by noise, distance is computed based 
on the definition of ED, represented as ED_N. HD is the weighted sum of MD_D and ED_N. To 
define HD, we need to resolve the following problems: 

> How to divide these two groups of dimensions? 

> How to adjust the ratio between MD_D and ED_N? 

The dominating dimensions can be selected based on the corresponding singular values. When 
the smgular values are smaller than a special threshold value think that the distribution in 

the related dimension is too concentrated to be considered. 

The second problem is more critical. Basically, we want to keep the emphasis on the character of 
ED_N without losing the adjustment from the MD_D. This is realized by adjusting the singular 
values to construct a new matrix Cmw which will replace the inverse of the covariance matrix 
used in the Mahalanobis Distance. The HD is defined as: 


HD = d»C^ •d’ 

where =(y •diag(s,,"-,s.)*y’) (2-3) 

is the afteradjustmera l</<n 

Different ways to adjust singular values give us several definitions of HD: 

1) HDl measure 

In this definition, we add a noise level to aU the dimensions and then normalize all the 
features by two times the noise level. The noise level is equal to the threshold cr^, which 
separates the noise dimensions and major related dimensions. The calculation of is 
described as follows 


(2-4) 


Nl = CTj 

Si = ((o’, + Nl)/2 • Nl) ^ all the i, \ ^ i ^ m 

= y •diag(si,---,s.)^y' 

2) HD2 measure 

.As discussed, the singular values are precisely the lengths of the semi-axes of the 
hyperellipsoid E. The definition of HD2 tries to keep the shape of the hyperellipsoid E but 
shrinks its volume to a unit size. So all the targets' distributions are normalized and the 
adjustment from the distributions is based on the relationship between singular values. 

The matrix is obtained as follows 


{ 


O’, = O-r 
O’, = cr, 




all the i, when o’, s tjj. 
otherwise 

all the I, I S I £ m 


(2-5) 
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In the next experiment we used the 5-point averaged chip data as testing data instead of 
0_profiles. The result of Experiment 2 is much better than that of Experiment 1, as shown in 
Table 2-2. 



Table 2-2 Result of Experiment 2 

(template: a chip data; observation : 5-point average chip data) 

However, the time cost of the algorithm increases too much to be accepted. To improve the 
efficiency of the algorithm, we merge the neighboring templates to construct a larger template 
OnR by using all the chip data within 10® degree azimuth intervals. This reduces the number of 
templates to be checked for a single observation without shrinking the search space. The 
experiment result (Table 2-3 - Experiment 3) is better than that of Experiment 1, but worse than 
that of Experiment 2. At the same time, its cost is between that of Experiment 1 and 2. These 
three algorithms can be chosen based on the particular requirement of the real problem. (There is 
another configviration which uses large template covering 10° degree intervals and 0_profiles as 
observations. That experiment is much faster than Baseline approach, but its performance is 
worse than that of the Baseline approach. Therefore, that result is omitted in this report.) 



Table 2-3 Result of Experiment 3 

(template: all chip data within 10° degree; observation : 5-point average chip data) 
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Within a specified range of the azimuth angles, this function can be approximated by a 
known functional. The simplest form of this functional is the linear function, that is 

g'{X, azimuth) = a{X) • azimuth 

where a{X) is a constant determined by the distribution of X 
Substituting (3-3) into (3-2), we get 

/'{azimuth) = a{X) • azimuth + c{X) (J-4) 

So, instead of constants, we use linear functions to express the distribution of training 
signatures at specified azimuth angles, as shown by the dashed line in Fig. 3-1. 
Furthermore, the new template model of one target is represented by a group of lines 
instead of a group of constant values used in the template approach. We call it a 



Figure 3-1 Two Template Representations Using the Tr ainin g Data 

The main principle behind the piecewise linear template approximation is to make an explicr. use 
of the training data variation with the azimuth angle by using localized linear regression fits. The 
corresponding algorithm is described next as the PWL algorithm. 

3.1 The PWL algorithm 

1.2.2 Linear regression fit to training data 


Suppose that we have a vector of i-th range bin values X corresponding to various azimuth 


angles, which comprise of the vector A . The expected multidimensional line projected onto this 
range bin dimension minimizes the following equation: 



where 7 = [l,”-,l] R" 


3-5) 
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(3-8) 


Err _P^ =m/mmjzejj P-{fiy • K + •/) 


where P =a» X^\ P P 

To sansfy these equations we must solve the following: 
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(3-9) 


This calculation is repeated for each shift of the profile vector, and the minimum distance 
determined for each one, using distance from a profile vector to the multidimensional line 
segment. 


1.3 ROC curves 

In this section the confusion matrix and receiver operating characteristic (ROC) curves are used 
to describe the performance results of the PWL classifier. The ROC curves show the tradeoff 
between the probability of false alarm P&, probability of declaration Pdec, and the probability of 
correct identification Pec comprehensively, while the confusion matrix summarizes the 
performance estimates at a single operating point. We break the three parameters in ROC curves 
into three sets, represented by three two-dimensional ROC curves. The first ROC curve relates 
conditional Pee given Pjec as a flmction of Pjeeand will be refereed to as ROCl. The second ROC 
curve, referred to as ROC2, relates Pa as a fimetion of Pdec- The last ROC curve, referred to as 
ROC3, shows Pdee vs. Pft. 

Fig.3-3 and 3-4 summarize the performance of PWL classifier for the 10 target classes, compared 
to Baseline approach.. In both methods a single-look, with factor = 1.6, was used for MSTAR 
public targets from collections 1 & 2, scene 1. The overall average results obtained for the 
Baseline approach can be summarized by Pkj=78.7 %, Pdec=97.2 %, Pf,=66 %, while for the PWL 
method the corresponding results were Pi<i=85.7% ( increase by 7%), Pdec=96.8% (decrease by 
0.4°/e), Pfa=58% (decrease by 8%). PWL method used 10 degrees PWL templates with 
observations extracted from 5-point averaged chip data. And the ROC curves show that the PWL 
clasrlfier outperforms the Baseline in most operating points. 
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4. Conclusion 

This report describes a study of statistical methods developed to improve classification 
performance of HRR based target recognition for air-to-ground images of moving targets. The 
clustering program described in this report is a parametric approach. It starts with an estimate of 
the local distribution and efficiently avoids the dependence on having to pre-assume the cluster 
number. Tne clustering program is applied to the HRR data classification and improves most 
experiments. The current application is limited to low dimensionality and a two-class 
classification problem. In our future work, we plan to extend the clustering close to the real 
application. To extend the clustering program to higher dimensionality, more experiments on 
Thd_growth are needed. In addition, designing a multi-class classifier to make use of the 
clustering results is another important topic. 

The piecewise linear method presented uses a localized linear-regression fit to the 
training data. It uses signal alignment and normalization which minimize least square error 
between the observation signals and PWL templates. PWL intervals can be selected 
automatically to minimize the linear regression fit error to the training data. The size of PWL 
intervals was optimized for classification performance. The method shows a noticeable 
improvement over the Baseline algorithm on the test data. 

The HD measures were developed to capture statistically significant features in the signal 
space and use them to determine similarity between templates and test data. It was demonstrated 
that HD measures improve classification performance. The measures depend on information 
noise level that varies with the classification problem and its training data set. This noise level 
was arbitrarily selected to classify signals in HRR/ATR problem. Even though the noise level 
was not optimized in our study, HDs demonstrated an improvement performance over the 
Baseline approach. Using HD measures, we can optimize computational effort by grouping 
training data into clusters of different sizes. Both the cluster sizes and their location can be 
optimized to improve the classification performance. This was done only partially (only the 
cluster sizes were optimized) in our study. Further optimization, which uses our work on 
clustering should be performed. Both methods (PWL and HD) were studied independently and 
each one independently showed an improvement over the Baseline algorithm. Therefore, it is 
recommended that both approaches should be combined to further improve the classification 
results. 
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Abstract: 


This report presents results of research on using clustering approach, hybrid distance measures 
and piecewise linear template representation for feature extraction and classification of air to 
ground HRR signals in a statistical approach. This HRR ATR is developed under the System 
Oriented HRR Automatic Target Recognition, or SHARP program. It includes establishing a 
statistically supported clustering strategy, developing Matlab 5.0 programs for cluster growth and 
merging, evaluating clustering quality for feature extraction and classification of the HRR signals, 
and demonstrating classification accuracy. Next, statistical data representation based on hybrid 
distance measures was developed, and tests performed on five-class data set to illustrate 
properties of these measures. Finally, a new piecewise linear strategy to represent statistical 
information of the training data is described. Both approaches were complemented with 
developed Matlab 5.0 programs for data representation, evaluation of computational cost and 
quality of developed templates and results of test on HRR data for five and ten class problems. 
Reference to the baseline method and comparative results for classification accuracy are 
discussed following the description of each approach. 
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1. Clustering 

The clustering problem is defined as a problem of classifying a group of data points into a 
number of clusters without any prior knowledge. According to [Fun 90], the existing 
clustering methods can be divided into parametric and nonparametric approaches. The 
parametric approach always has a better performance than the nonparametric one. 
However, most parametric techniques require an assumption of cluster number, which is 
not easy to estimate in the real applications. 

In this report, we derive a parametric clustering approach to express data distributions by the 
Multi-Gaussian method. It does not need to pre-assume the number of clusters, making it more 
suitable for our application. 

1 1 DISCRIPTION OF CLUSTERING PROGRAM 

To avoid the dependence on the pre-assumed cluster number, our clustering program starts with 
the estimate of the local distribution. The small clusters, called seed clusters, are constructed by 
the cluster growth program according to the local distribution of training data. Then the cluster 
merge program merges those clusters whose distributions are consistent The merging is 
performed in order to reduce the complexity of data representation as well as to provide a 
statistically supported generalization ability for classification. 


1.1.1 CLUSTER GROWTH PROGRAM 

In the Cluster Growth program, we construct seed clusters in the follo\^Tng way. First, we 
randomly choose one signal to be the beginning of the cluster, and then absorb the nearest 
neighbors of the cluster selectively. Here the nearest neighbor of the cluster is defined as the point 
whose distance to the cluster, or to the nearest point in this cluster, is shortest. This distance will 
be compared with the threshold, Thd_Growth. If it is within Thd_Growth, this point will be 
absorbed and the process will be repeated on the next nearest neighbor, otherwise, the cluster 
growth is terminated. 

The Thd_Growth is selected based on the results of the statistical experiments. In these 
experiments, we assume that data samples in a local neighborhood follow a uniform, rather than 
Gaussian distribution. That is because a cluster constructed by the seed cluster growth program 
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Fig. 1-2 An illustration demonstrating the non-overlapped area 
The threshold is determined by the statistic experiments. In the experiments, we use two groups 
of uniform distributed data to simulate the seed clusters. This assumption doesn’t undermine the 
generality of experimental results. Moreover it is straightward to find whether the two uniform 
models should merge or not. 

The experimental results show the NOA decreases when the means of two clusters get close. And 
this regularity vanishes when the sizes of the two clusters are much different, as shownin Fig.1-3. 
In Fig. R_size is defined as the ratio of two cluster’s sizes. 







As seen in the results, clustering improves most situations. It helps in the following ways: 

1. Clustering can follow the structure of data closely and give a good approximate pdf. It 
divides the data points into clusters according to the valley existing in the density function 
without prior knowledge of data distribution. In other words, this parametric clustering 
method has the same performance as the nonparametric approach. Fig. 1 -6 shows one 
clustering result of Target 3 (when classified with Target 2) 

2. Moreover, clustering is helpful when dealing with overlapped classes. Fig. 1-7 shows the 
overlapped part of Target 3 and Target 12 with clustering results. In the figure, the points 
from Target 3 are presented as dots, and their clusters are shown with the dashed ellipses. 
Five clusters in Target 12 are shown with different symbols (such as circle, star and etc.). As 
seen in this figure, clustering separates points based on their local distribution. This way, 
some points from the non-overwhelming class still have a chance to be classified correctly. 



Fig. 1-7 A clustering result on an overlapped area 
(overlapped part of Target 3 and Target 12) 
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( 2 - 1 ) 


ED = d»d' 
where d = 0-f 

O is the observation signature to classify 
f is the mean of the group of template signatures 

When this is not the case the signal can be misclassified. In addition, if one-degree templates 
represent a good fit to the training data, one would expect that the least square error will be 
almost constant for the profiles near the template azimuth angle. As illustrated in Fig. 2-2 this is 
not the case. This figure shows a large variation of the aligned least square error between of 8- 
point moving average of a single chip data and a template. 

To improve our classification, we consider using Mahalanobis Distance (MD) instead. The 
definition of MD involves the covariance matrix of the template data group: 

MD = d *Ti ‘ * d' (2-2) 

where Z is the covarianceof the group of template signatures 

With the help of the statistical information of template distribution, MD is expected to provide 

better classification result than ED does. 



Figure 2-2 Aligned least square error between of 8-point moving average 

However, MD is not suitable for HRR data. First, its performance is based on the estimate of the 
statistical characters, therefore the requirements for the training data are strict. Second, it just 
provides a resolution for the feature extraction in the linear transformation space of the original 
features. Third, it can't resolve the problem when the observation and template signatures have a 
different feature space that is typical in HRR data. To overcome these shortcomings, we 
developed a mixture of ED and MD, called Hybrid Distance (HD). 

2 .1 Hybrid distance measure 

The hybrid distance (HD) combines the MD and ED in the dominating dimensions. In the 
dominating dimensions, MD is calculated to of statistical information of the data. 
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3) HD3 measure ' 

HD3 is defined based on the comparison to ED. As it is known, 

ED = d •d' 

= d •V •V'»d’=D»D' where D = d»V 

= D = [d„-,d,] 

>•1 

and HD is 

HD = d*C^*d' 

= d •V • diag(s,'\---,s/^)» y •d' 


( 2 - 6 ) 


(2-7) 


To keep the dominating character of ED in the hybrid distance measure, HD3 makes 

N , 

= n . The calculation required for this distance is as follows: 

/=! 



C^ = V*diag{s„-,s,)*V' 


all the i, when (j. s Oj 
otherwise 

all the i, \ a ^ m 


( 2 - 8 ) 


2.2 Application results 

In this section, we applied EQD definitions to HRR data with three different configurations. The 
first experiment uses one chip data to estimate the template model and use O_profiles as testing 
data, which is similar with the configuration of the Baseline approach. This experiment result is 
given in Table 2-1. The last row shows the time cost for each approach. All the HD approaches 
outperform the Baseline in classification performance with the similar time cost. Among them, 
HD3 and HD2 are a little bit better than HDl. 


Table 2-1 Result of Experiment 1 
(template: a chip data; observation : O_profile) 
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Fig.2-3 gives the Receiver Operating Characteristic curves [Wes98] of Experiment 3 
compared with the Baseline performance. As shown in Fig.2-3, all the HD approaches have better 
(PidlPdec) [Wes98]than the Baseline, especially the HD2 and HD3. Their Pf, is similar with that of 
the Baseline for large Pdec [Wes98] and worse than the Baseline for lower P<jec. Similar problem 
has been discussed in the study of PWL algorithm. (Section 3). 
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Figure 2-3 ROC 1&2 of Experiment 3 (HD) 


3 Piecewise Linear Approach 

The baseline classification makes use of the mean value of a chip signatures as the 

template for a specified azimuth and target class. It uses a constant value based on the 

mean of the chip data at a specified azimuth window. 

f {azimuth)=mean{X) . (3-1) 

As a result, for each range value, a horizontal line effectively estimates all signatures X in 

this chip. Even though, data distribution varies wdth respect to the azimuth, the estimate used in 

the template approach does not make an explicit use of this variation. If the relationship between 

the distribution of the training samples and the azimuth value can be estimated and used to 

construct the training model, the classifier performance would be improved. Suppose that a 

training data X varies as a function of azimuth as follows: 

/'{azimuth) = g{X, azimuth) + c{X) (3-2) 

where c(X) is the constant related only to the training signature X, and g(X,azimuth) is a 

complicated function and it is expensive and difficult to get its exact estimation. 
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The solution of this minimization problem yields 2x2 linear equation that provides two linear 
regression fit coefficients kj and bj for each range bin value: 

HI 

^ ‘ ’ (3-( 

V * f ^ 

LOj m ■- .J 

. /■=' J 

Fig. 3-2 gives an example of linear regression fit to the real HRR data of target 2slb01. 



Figure 3-2 Two Template Representations for an example HRR Data 
(target 2sl_b01, feature 58 with azimuth =2.2248) 

Repeating this calculation for various range bins, we can get two vectors, K and B, 
which describe the multidimensional linear regression fit to the training data. These vectors 
are respectively, 

K = [A:,,-.*.] 

B = [6..-.6.] 

where n is the number of range bin 

As a result, training data is described by two constant vectors K and B rather than a single one 
used in the template approach. 

1,2.3 Profile and template line alignment 


The PWL method uses a line equation as a template, which can be expressed by the vectors K, B 
on both sides of this line segment as 

f(a) = a»K + B for a, <a<a„ (3-7) 

The alignment problem is to find the point from this line and the corresponding alignment 

parameters, /I, and , such that the discriminant function (an error function) of the aligned 

template and the profile has a minimum norm. For the convenience of calculation, the three 

alignment parameters are combined and the norm of the error function can be calculated as: 
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Probability of Declaration 

Figure 3-3 ROC Curves 1 and 2 of PWL Approach for 10 Targets 



Probability of False Alarm 

Figure 3-4 ROC 3 of PWL Approach for 10 Targets 

The PWL classifier with this configtiration has a similar computation cost with the Baseline 
approach but a better performance. The process for 5 targets takes both classifiers about 100 
minutes in PC (233MHz). Moreover, we can get a faster algorithm with losing a little 
performance by adjusting the parameters inside the program, such as the number of observation 
data. This scalability of PWL is helpful for the future research and improvement of the 
developed approach. 
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